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and the Behavior of the Simplex Method 

Abstract 

This dissertation investigates the geometric combinatorics of convex polytopes and 
connections to the behavior of the simplex method for linear programming. We focus our 
attention on transportation polytopes, which are sets of all tables of non- negative real 
numbers satisfying certain summation conditions. Transportation problems are, in many 
ways, the simplest kind of linear programs and thus have a rich combinatorial structure. First, 
we give new results on the diameters of certain classes of transportation polytopes and their 
relation to the Hirsch Conjecture, which asserts that the diameter of every d-dimensional 
convex polytope with n facets is bounded above by n — d. In particular, we prove a new 
quadratic upper bound on the diameter of 3-way axial transportation polytopes defined by 
1-marginals. We also show that the Hirsch Conjecture holds for px2 classical transportation 
polytopes, but that there are infinitely-many Hirsch-sharp classical transportation polytopes. 

Second, we present new results on subpolytopes of transportation polytopes. We 
investigate, for example, a non-regular triangulation of a subpolytope of the fourth Birkhoff 
polytope B4. This implies the existence of non-regular triangulations of all Birkhoff polytopes 
B n for n > 4. We also study certain classes of network flow polytopes and prove new linear 
upper bounds for their diameters. 

The thesis is organized as follows: Chapter [T] introduces polytopes and polyhedra and 
discusses their connection to optimization. A survey on transportation polytopes is presented 
here. We close the chapter by discussing the theory behind the software transportgen. 
Chapter [2] surveys the state of the art on the Hirsch Conjecture and its variants. Chapter [3] 
presents our new results on the geometric combinatorics of transportation polytopes. Finally, 
a summary of the computational results of the software package transportgen are presented 
in Appendix [A} 
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CHAPTER 1 

Polytopes, Polyhedra, and Applications to Optimization 

A polytope is the generalization of two-dimensional convex polygons and three-dimensional 
convex polyhedra in higher dimensions. Polytopes appear as the central objects of study 
in the area of geometric combinatorics, but they also appear in mathematical fields as 
diverse as algebraic geometry (see, e.g., [226J, [262J, and [269J), representation theory 
(see, e.g., [33], [125] . and [125] ) . lattice point enumeration (see, e.g., [25], [3D], [pT], [52] , 
and [289J), and optimization (see, e.g., [88], |144j . and [247] ). This dissertation discusses 
connections between optimization and the geometric combinatorics of polytopes. 

Before formally introducing polytopes, we discuss some concepts from affine linear 
algebra that will appear throughout this dissertation. Fix a positive integer c E Z>o- We 
will typically work in the c-dimensional Euclidean space M c . A set X C M c is convex if 
Ax + (1 — X)x' € X for all pairs (x,x') £ X x X and every A £ [0, 1]. The convex hull 
conv(A) of a set X C l c is the smallest convex set that contains X. That is to say, the 
convex hull of X C M c is defined as: 



The convex hull of a convex set is the set itself. (For more on convexity, refer to |24j, 
Chapter 1 in [210], or [286] .^ 

Now we give our first examples of convex sets. A set X C M c is called an affine sub space if 
it is a parallel translate of a linear subspace. The dimension dim(X) of an affine subspace X 
is the dimension of the parallel linear vector subspace. The zero-dimensional affine subspaces 
are the points and the one-dimensional affine subspaces are the lines. The affine subspaces 
of dimension c — 1 are called hyperplanes. (In other words, we say that hyperplanes are 
codimension one affine subspaces.) The affine hull aff (X) of a set X C M c is the smallest 
affine subspace that contains X. 




xcc 

C convex 



Formally, a poly tope P is the convex hull of a finite number of points in M c . (According 
to our definition, a polytope is always convex, so we do not distinguish between polytopes 
and convex polytopes.) The integer c G Z>o is called the ambient dimension of the polytope. 
This number may differ from the dimension of the polytope: the dimension dim(P) of a 
polytope P is the dimension dim(aff(P)) of its affine hull aff(P). We typically use d to 
denote the dimension of a polytope. A d-dimensional polytope is also called a d-polytope. 
When c = d, we say that the polytope P is full- dimensional. Otherwise, the number c — d > 
is the codimension of P. 

A polyhedron P is the intersection of finitely many closed half-spaces in M c . A polytope 
is also a polyhedron: more precisely, polytopes are those subsets of M c that are both bounded 
and polyhedra (see, e.g., [290]). The dimension, ambient dimension, and codimension of a 
polyhedron are defined in the same way as for a polytope. A ci-dimensional polyhedron is 
also called a d-polyhedron. 

Remark 1.0.1. We have given two equivalent definitions of a polytope. A polytope is 
either the convex hull of a finite set or the bounded intersection of a Bnite number of closed 
half-spaces. Though the definitions are equivalent (due to the Weyl-Minkowski Theorem - 
see, e.g., [290 J, from a computational point of view it makes a difference whether a certain 
polytope is represented as a convex hull or via linear inequalities: the size of one description 
cannot be bounded polynomially in the size of the other, if the dimension d is not fixed 
(see [13j ). For polynomial-time convex hull algorithms on a Bnite set of points in dimension 
two, see [89j or Chapter 33 of [73 1 . (For algorithms in higher dimensions, see, e.g., |68j . ) 

A hyperplane is the set of points x G M c satisfying a\X\ + ■ ■ ■ a c x c = m for some non-zero 
vector a = (ai, . . . , a c ) G M c and a real number m. (A linear hyperplane is a hyperplane that 
contains the origin, i.e., m = 0.) Since a hyperplane is the intersection of the two half-spaces 

{i G l c a\x\ + . . . a c x c < m} and {x G M c | a\X\ + . . . a c x c > m}, 

we can also say that a polyhedron is the intersection of a finite number (possibly zero) of 
linear equations and a finite number of linear inequalities. 

Remark 1.0.2. Though we have already given one definition for a hyperplane and two 
dehnitions for a polytope, it will be helpful to have in mind a second definition for a 



hyperplane and a third definition for a polytope. For this, we need to introduce afhne and 
convex combinations. 

Let finite collection of n points in M c . A point of the form 



(i.i) * = E A 



8=1 



where Ya=i ^ = ^ JS ca ^ e d an affine combination of the points xi, . . . , x n . The collection of 
points xi, . . . ,x n are said to be affinely independent if none of the points can be written as 
an affine combination of the remaining n — 1 points. A hyperplane is the set of all possible 
affine combinations of c affinely independent points in M c . More generally, an afhne subspace 
is the set of all possible affine combinations of n points in M c . 



n ■ 



An affine combination of the points x\, . . . ,x n in M c of the form ( 1.1 ) that meets the 
additional condition that each A« > is called a convex combination of the points 
A polytope is the set of all possible convex combinations of a hnite set of points. The collection 
of points xi,...,x n are said to be in convex position if none of the points can be written as a 
convex combination of the remaining n — l points. For example, for two distinct points x and 
y in M c , the interval between them is the set [x, y] = {z £ IR C | z = Xx + (1 — X)y, < A < 1} 
of all possible convex combinations. 



We will show in Section 1.3 (see Lemma 1.3.3) that polytopes and polyhedra in opti 



mization are typically presented in the form P = {x E M c | Ax = b and x > 0} where A is a 
matrix of size r x c and b is a vector in W . 

Definition 1.0.3. Let A be a matrix of size r x c and let b b a vector in R r . A polyhedron 
of the form 

(1.2) P = {x e R c | Ax = b and x > 0} = {x e M| | Ax = b} 

is called a partition polyhedron. The matrix A is called the constraint matrix of the partition 
polyhedron P. 



The name partition polyhedron is motivated by Lemma 1.0.12 and the discussion in 
Section 11.51 



Remark 1.0.4. We will define transportation polytopes in Section 1.4. but for now we 
simply remark that they are of this form. 

Example 1.0.5. Here are examples of polytopes and polyhedra. They will reappear as 
running examples throughout this chapter: 

• The d-simplex. The d-simplex generalizes the triangle in dimension two and the 
tetrahedron in dimension three. The standard d-dimensional simplex A^ is the 



convex hull of the standard unit vectors e%, . . . , e^+i in 



pd+l 



In this description, 



the ambient dimension of is c = d + 1. The d-simplex can also be dehned as a 



bounded partition polyhedron: Ad 
See Figure [7j| 



{x £ 



XI 



+ x d+1 = 1,0 < x < 1}. 





Figure 1.1. The standard 1-simplex, 2-simplex, and 3-simplex. 



The (i-cube. The ci-dimensional 0-1 cube is the convex hull of the 2 d points 
with coordinates or 1. In this description, the d-cube is full-dimensional, i.e., 
c = d. The d-cube can also be described as the intersection of 2d inequalities, i.e., 
{x E R d | < Xi < 1 Vi}. See Figure 



1.2 




Figure 1.2. The 0-1 cubes of dimensions one, two, and three. 



(1.3) 



The d-dimensional cross polytope. The ci-dimensional cross polytope Od is 
the convex hull of the d standard unit vectors e\ , . . . , and their negatives. For 



d = 3, this is an octahedron. See Figure 1.3 
Let P 3x3 = {x € R 9 | Ax = b, x > 0} where 



A 



1 1 1 

1 1 1 

1 1 1 

1 1 1 
1 1 1 



and 



2 
7 

2 
5 
5 



We will prove later that the polyhedron -P3X3 is bounded and is, therefore, a polytope. 
(In fact, we will later show that it is a 3 x 3 transportation polytope.) Since the 
5x9 matrix A has full row rank, the solution set of the matrix equation Ax = b 
is an afhne subspace of M 9 of dimension 9 — 5. Thus, P 3 x3 is a four-dimensional 
polytope dehned in a nine-dimensional ambient space. 




Figure 1.3. The cross polytopes of dimensions two and three. 

The main examples of unbounded polyhedra we will see are the polyhedral cones. A 
subset of M c that is closed under addition and under multiplication by non-negative scalars 
is called a cone. Equivalently, a cone is the intersection of linear half-spaces. (A linear 
half-space is a half-space defined by an equation of the form a\x\ + • • • + a c x c > 0, for some 
non-zero vector a £ M c .) For any set X C R c , the cone generated by X, denoted cone(X), is 
the set of all points that can be expressed as non-negative linear combinations of vectors 
in X. A cone generated by a finite set X is called a polyhedral cone. Equivalently, a cone 
is polyhedral if it is the intersection of finitely many closed linear half-spaces. (Since we 
are only interested in polyhedral cones, we will use the terms "cone" and "polyhedral cone" 



interchangeably. That is to say, all cones we discuss are finitely generated.) A d-dimensional 
cone is called simple if it is generated by a set X of cardinality d. 

Example 1.0.6. The orthant {x G R d \ Xi > for all i G [d]} is a simple d-dimensional 
cone. 

We now introduce the primary concepts that will appear throughout this dissertation. 
For further details, see [145j or [290 . 

We focus on particular subsets of a polytope (or a polyhedron) PCM C known as faces. 
A face F of P is obtained in the following way: let a G M c and m G K such that 

(1.4) (a, x) = ai^i + • • • + a c x c < m for all x = (x\, . . . ,x c ) G P. 



When this condition is satisfied, we say that the inequality (1.4) is valid and determines 



the face F = P n {x G M c | (a, x) = m}. Now, suppose that a G M c is non-zero. Then the 



inequality (1.4) defines a closed half-space H and the equation (a, x) = m is the boundary 
hyperplane of -ff , denoted by dH. The set F is non-empty only when dH intersects the 
boundary of P. In other words, a face is the intersection of P with a supporting hyperplane. 



See Figure 1.4 for an illustration. Since F is the solution set of linear equations and 



Figure 1.4. A polytope P and one of its non-empty faces. The supporting 
hyperplane that defines the highlighted face is shown. 

inequalities, this subset of P is a polyhedron as well. A face of dimension i is called an i-face. 
The faces of certain dimensions have special names. The 0-faces are called vertices of P, the 
1-faces are called edges, the (d — 2)-faces are called ridges, and the (d — l)-faces are called 
facets. What happens if we pick a G M c to be the zero vector? By choosing m G {0, 1}, we 



see that P itself and the empty set are both faces of P. These two faces are called the 
non-proper faces and the faces of intermediate dimension are called the proper faces. By 
convention, the empty face is said to have dimension —1. 

Remark 1.0.7. The definitions above are motivated by what faces of polytopes "look like." 
For cones and unbounded polyhedra, we typically make a distinction between the bounded 
and unbounded one-dimensional faces. The bounded faces of dimension one are called 
bounded edges while the unbounded faces of dimension one are called rays. (A cone may 
have at most one bounded non-empty face, namely the origin. The unbounded faces of a 
cone are themselves cones. If the origin is a vertex of a cone, then it is called pointed. The 
orthants are examples of pointed cones.) 

The relative interior of a set X C M c is a useful refinement of the concept of the interior 
of X. First, fix any metric on M c . (Though any metric works, we should have the Euclidean 
metric in mind.) Recall that the interior int(X) of a set X C M c is 

int(X) = {x e R c | 3 e > 0, N € (x) C X}, 

where N e (x) is the open ball of radius e centered at the point x. In particular, if the 
polyhedron P is not full-dimensional, then int(P) = 0. (Intuitively, this is the very thing we 
want to avoid as we try to discuss the "inside" of a polytope or polyhedron. For this, we 
define the relative interior.) The relative interior int*(X) of a set X is defined as its interior 
within its affine hull aff(X). That is, 

int*(X) = {x G R c | 3 e > 0, N e (x) n aff(X) C X}. 



See Figure 1.5 for an example. Note that dim(aff (int*(X))) = dim(aff(X)). In particular, 




Figure 1.5. A polygon P defined in a three-dimensional ambient space and 
a point x in the relative interior of P. The set N t (x) n aff (P) C P is depicted. 



the relative interior of a point x £ M c is itself. Unlike the interior of a polyhedron P, the 



relative interior of P remains invariant under embedding in higher-dimensional ambient 
spaces. The relative interior of a polyhedron P is the set of all points in P that do not 
belong to any proper face of P. The relative boundary of a set X is d*{X) = X \ int*(X), 
where X denotes the closure of X. Since polyhedra are closed, the relative boundary of a 
polytope or polyhedron is the union of all of its proper faces. To make these concepts clear, 
we state a fundamental decomposition theorem for polytopes and polyhedra. An illustration 
is given for a tetrahedron in Figure |1.6[ 

Proposition 1.0.8 (Polyhedron Decomposition Theorem). Let PCK C be a polyhedron. 
Then P is the disjoint union of the relative interiors of its faces. 



o 




Figure 1.6. A tetrahedron P decomposed into the relative interiors of its faces. 

Remark 1.0.9. In dehning a polytope P as the convex hull of a finite set X of points, one 
may give a description of P that is redundant if the set X includes points that are "on the 
inside of P." Specifically, if x G X is in the relative interior of the polytope P = conv(X) 
(or even stronger, if x G X is not a vertex of P), then P = conv(X) = conv(X \ {x}). In 
its irredundant description, a polytope is the convex hull of the set of its vertices. (See 
Figure [i~7| ) Similarly, in its irredundant description, a polyhedron is the intersection of its 
facet-defining closed half-spaces. A cone is minimally generated by its rays. 

We present a well-known characterization of vertices of a polytope or polyhedron (see 
Section 2.4 of [145] ). 

Lemma 1.0.10. Let P C M c be a d-dimensional polyhedron and suppose x G P. The 
following conditions are equivalent: 

(1) The point i£R c is a vertex of P. 




Figure 1.7. The same polygon, in irredundant and redundant descriptions. 



(2) For every non-zero vector y 6 M c , at most one of x + y or x — y belong to P. 



Proof. First, suppose that the point x £ P is a vertex of P. Suppose, for a contradiction, 
that there is a non-zero vector y £ M c such that both x + y and x — y belong to P. Then, 
the points x + y and x — y are distinct and the set L = [x — y, x + y] C M c defines a line 
segment. Moreover, since P is convex, it contains L. Thus, any supporting hyperplane of P 
that contains x also must contain L. But this means that any face of P that contains x also 
contains the line segment L, and therefore the singleton set {x} cannot be a face of P, a 
contradiction. 



Conversely, suppose that x £ P is not a vertex of P. By Proposition 1.0.8, the point x 
is in the relative interior of some i-face F of P, with i > 1. Clearly, x + y and x — y both 
belong to F for any sufficiently small vector y £ M c parallel to the affine hull of F. □ 



Example 1.0.11. Let us revisit some of the polyhedra of Example 1.0.5 



The d-simplex has d + 1 vertices and d + 1 facets. Every pair of vertices is 
contained in an edge. 

The vertices of the d-cube are the 2 d points with coordinates or 1. Two 
vertices x and x' of are contained in the same edge exactly when all but one 
of the coordinates of x and x' are the same. The facets of Od are given by the 2d 
inequalities < Xi < 1. 



Let us verify Lemma 1.0.10 for some points on the 2>-cube LI3. The point 
x = (0,jj|,0) lies on an edge of LI3. A vector such as y = (0, ^,0) shows that 



condition (2) of the lemma holds. See Figure 1.8 For a vertex, the condition is 



easiest to see at (0,0,0), where one can easily check that for non-zero vectors y 
belonging to P, the vector —y is not in P. 




Figure 1.8. The point x = (0, 2,0) is not a vertex of the 3-cube LI3. From 
x, one can move in the direction y = (0, |, 0) and in the opposite direction 
-y = (0,-1,0). 

• The d-dimensional cross polytope 0d has 2d vertices: the d standard unit vectors 
and their negatives. It has 2 d facets, one in each orthant ofM. d . 

• Using the software polymake (see [136J) available at |135| . one can compute the 
twelve vertices of the polytope P3x3- They are: 

zi = (2, 2, 1, 0, 5, 0, 0, 0, 1), z 2 = (2, 1, 2, 0, 5, 0, 0, 1, 0), z 3 = (1, 2, 2, 0, 5, 0, 1, 0, 0), 
z 4 = (1, 4, 0, 0, 3, 2, 1, 0, 0), z 5 = (2, 3, 0, 0, 3, 2, 0, 1, 0), z 6 = (2, 3, 0, 0, 4, 1, 0, 0, 1), 
z 7 = (0, 4, 1, 2, 3, 0, 0, 0, 1), z 8 = (0, 3, 2, 2, 3, 0, 0, 1, 0), z 9 = (0, 3, 2, 1, 4, 0, 1, 0, 0), 
z w = (0, 5, 0, 2, 2, 1, 0, 0, 1), z u = (0, 5, 0, 2, 1, 2, 0, 1, 0),z w = (0, 5, 0, 1, 2, 2, 1, 0, 0). 

If P is presented as a partition polyhedron of the form ( |1.2[ ), then there is another 
characterization for the vertices of P, which we describe now. If P = {x 6 M c | Ax = b, x > 0} 
is non-empty, then we can assume that the matrix A has full rank. Indeed, if A is not full 
rank, then one or more linear equations in Ax = b is redundant. So, let A be an r x c 
matrix of full row rank and let b € W. Let P = {1 £ l c Ax = b, x > 0}. With a slight 
abuse of notation, by cone(^4) we mean the cone generated by the set of column vectors of 
the matrix A. A maximal linearly independent subset A of columns of A is a basis of A. 
Geometrically, each basis A of the matrix A spans a simple cone inside cone(A). (Recall 
that a d- dimensional cone is simple if it has exactly d rays.) Every basis A of A defines a 
basic solution of the system as the unique solution of the r linearly independent equations 
Axa = b and Xj = for j not in A. A basic solution is feasible if, in addition, x > 0. 
Geometrically, a basic feasible solution corresponds to a simple cone that contains the vector 



11 



b. In fact, one can see that the polytope P is non-empty if and only if b G cone(A). A 
fundamental fact in linear programming is that, for a given vector b G M r , all vertices of the 
polyhedron Pf, are basic feasible solutions (see, e.g., |247| or [288J). So, we have proved the 
following characterization for vertices of partition polyhedra: 

Lemma 1.0.12. Fix an r x c real matrix A with full row rank r. Fix a vector b G IR r . Let 
P be the partition polyhedron P = {x G M c | Ax = b, x > 0}. Then x is a vertex of the 
polyhedron P if and only if x is a basic feasible solution. 

Polytopes and polyhedra have a finite number of vertices: 

Proposition 1.0.13. Let P be a d-dimensional polyhedron. Then, P has a finite number 
of vertices. 

Proof. Clearly, a <i-polytope P has a finite number of vertices. A ci-polyhedron P 
defined by n facets also has a finite number of vertices: since at most one vertex is found 
at the intersection of d + 1 facets, the number of vertices is trivially bounded above by 




Let vert(P) denote the set of vertices of P. We denote its cardinality by /o = | vert(P)|. 
Polytopes and polyhedra have finitely many faces: 

Proposition 1.0.14. Let P be a d-dimensional polyhedron. Then, P has a finite number 
of i-dimensional faces, for i G {— 1, . . . , d}. 

Proof. Let /o G Z>o denote the number of vertices of P. Each i-face of P contains 
i + 1 affinely independent vertices of P, and different faces of P have different affine hulls. 
Thus, the number of different i-faces of P is bounded above by (J^i) • □ 

Let ft = fi(P) G Z>o denote the number of z-dimensional faces of P. The f -vector 
of P is f(P) = /o, f x ,..., f d -i, f d ) G Z d + 2 . Note that /_x = f d = 1, but there is no 
complete characterization for the values of the other entries: the underlying interaction 
between combinatorics and algebraic geometry led to a complete characterization of the 
possible numbers of faces that a simplicial polytope can have (see [261] ) . The same question 
for arbitrary polytopes is open in dimension four and higher (see |291j ). The /-vector of 
every polytope satisfies the following well-known relation (see, e.g., [154J ) : 



1.1. GRAPHS OF POLYTOPES AND POLYHEDRA 



12 



Proposition 1.0.15 (Euler-Poincare Relation). Let P be a polytope and let 

f(P) = (/-lj /o, fi, • ■ ■ i fd-l, fd) 
be its f -vector. Then, the terms of the f -vector satisfy 

d 

E ("^ = -/-l + /o - /i + • • • + = 0. 

i=-l 

In the terminology of [154] . the reduced Euler characteristic of a polytope is zero, since 
polytopes are contractible. 

1.1. Graphs of polytopes and polyhedra 

We now introduce the graph of a polytope. Let P be a polytope. The graph (or 1-skeleton) 
of P, denoted by G(P), is the following undirected, finite, simple graph: 

• Vertices of G(P). The graph G(P) has a vertex for every vertex v of the polytope 
P. Let us denote the vertex of the graph G(P) corresponding to the vertex v of 
the polytope P by G{v). 

• Edges of G{P). Two vertices G(v) and G(v') in G(P) are connected by an edge 
in G(P) if there is an edge of the polytope P containing the corresponding vertices 
v and v' of P. When this occurs, the vertices v and v' of the polytope P are said 
to be neighbors. 



Figure 1.9 gives an intuitive example of the graph of a polytope. Let us review some basic 
terminology from graph theory. For more information, refer to [45], |46j . or [278] , The 
distance between two vertices in a graph is the minimum number of edges needed to go from 
one vertex to the other vertex. The diameter of a graph is the maximum distance between 
all pairs of vertices. The number diam(G(P)) 6 Z>o is the diameter of the graph of P. This 
number is also called the diameter of the polytope P. 

Remark 1.1.1. For an unbounded polyhedron P, the graph G(P) of P is defined in the 
same way, but G(P) contains only the bounded edges. The graph G(P) does not include 
the rays (the unbounded one-dimensional faces). 



Example 1.1.2. Let us revisit some of the polytopes and polyhedra of Example 1.0.5 



1.1. GRAPHS OF POLYTOPES AND POLYHEDRA 
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The graph of the d-simplex is the complete graph K^+i so its diameter is one. 
The graph of the 0-1 d-cube has diameter d: the steps needed to go from a 
vertex to another equals the number of coordinates in which the two vertices differ. 
The graph of the d-dimensional cross polytope 0d is almost complete: the only 
edges missing from G(0d) are those between opposite vertices. Thus, the diameter 
of a cross polytope is two. 

Using the software polymake (see [TB6J ), one can compute the graph 6*^3x3) of 



the polytope Ps X 3- The graph is shown in Figure 1.10 By inspection, the diameter 
of the graph is three. 

A graph is d-connected if the graph that remains after removing any d — 1 vertices is 
still connected. In |17j . Balinski proved that the graph of every ci-polytope is (i-connected: 

Theorem 1.1.3 (Balinski's Theorem, [T7|). Let P be a d-polytope. Then the graph G{P) 
is at least d-connected. 

A classic theorem of Steinitz characterizes the graphs of 3-dimensional polytopes. Chap- 
ter 4 in [290J contains a proof of it. No analogous theorem is known for 4-polytopes (see, 
e.g., [291]). 

Theorem 1.1.4 (Steinitz' Theorem, [2651 1266] ). Let G be a simple graph. Then G is the 
graph of a 3-polytope if and only if G is planar and 3-connected. 



1.2. GEOMETRIC COMBINATORICS OF POLYTOPES AND POLYHEDRA 
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Figure 1.10. The graph G(P 3x3 ) of the 4-polytope ^3x3- 



1.2. Geometric combinatorics of polytopes and polyhedra 

We turn now to the combinatorics of the faces of a polytope. The faces of a polytope P 
form a poset (or partially-ordered set). For more on posets and lattices, see Chapter 3 
in |263j . We describe just what is needed here: for a more complete discussion on the 
geometric combinatorics of polytopes and their faces, see Section 2.2 in |290| . The collection 
of all faces of a polytope P (including the empty face and the face P itself, which are the 
elements and 1, respectively) is a poset where order relation is given by inclusion. This 
poset is a lattice, called the face lattice of P, and denoted by L(P). The lattice L(P) is 
graded by the face dimension. A cover relation exists for two elements of L(P) whenever an 
(i + l)-dimensional face contains an z-dimensional face. We often represent the face lattice 
using a Hasse diagram, which has a node for each element of the poset and a vertical edge 
for each cover relation. 



Example 1.2.1. Figures 1.1 1\ and 1.12 (respectively) show examples of Hasse diagrams 



for the d-simplex with d = 2,3 (respectively). Figures 1.13 and 1.14 (respectively) show 



examples of Hasse diagrams for the d-cube with d = 2, 3 (respectively). Figures 1.15 and 1.16 
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(respectively) show examples of Hasse diagrams for the d-dimensional cross polytope with 

d = 2, 3 (respectively). 

Figure \1 . 1 7| shows the Hasse diagram of the four-dimensional polytope ^3x3- 

The largest element 1 in each poset is the polytope P itself, and the smallest element 

is the empty face 0. (These Hasse diagrams are traced copies of the output from the 

VISUAL_FACE_LATTICE command in polymake. See [T36] . ) 




Figure 1.11. Hasse diagram of the 2-simplex. 




Figure 1.12. Hasse diagram of the 3-simplex. 

We say that two polytopes P and P' are combinatorially equivalent if they have isomorphic 
face lattices, i.e., L(P) = L(P'). 
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Figure 1.13. Hasse diagram of the 2-cube. 




Figure 1.14. Hasse diagram of the 3-cube. 

Remark 1.2.2. Terminology for some basic polytopes is often used loosely. For example, 
any polytope that is combinatorially equivalent to the standard simplex is called a 
simplex. (In other words, the convex hull of any d + 1 afRnely independent points in M c is 
called a d-simplex.) Similarly, a cube is any polytope that is combinatorially equivalent to 
the standard 0-1 cube. Any polytope that is combinatorially equivalent to a cone is called a 
cone, or sometimes an afhne cone. 

Given a poset L, we define the opposite poset L A which has the same elements but the 
order relation is reversed. The Hasse diagram of the poset L A is a vertical reflection of the 
Hasse diagram of the poset L. 
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Figure 1.15. Hasse diagram of the 2-dimensional cross polytope. 




Figure 1.16. Hasse diagram of the 3-dimensional cross polytope. 

Example 1.2.3. Hasse diagrams of simplices are invariant under vertical reflection, so the 
face lattice of any d-simplex is its own opposite. The face lattices of the d-cube and the 
d-dimensional cross polytope are opposites of each other. 

Many of the results that we will survey in Chapter [2] are more natural in the polar 
setting. We now describe the polar of a polytope. This is the notion often called the dual, 
but we adopt the use of the term polar to distinguish polarity from duality in the sense of 
linear programming. (For more on polarity, see Chapter 7 in [286J or Section 2.3 in |290j.) 

We introduce the polar graph (or dual graph) of a polytope P. The polar graph of a 
polytope P, denoted by G A (P), is the following undirected, finite, simple graph: 
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Figure 1.17. Hasse diag ram of the polytope -P3x3- 



• Vertices of G (P). The graph G (P) has a vertex for every facet / of the 
polytope P. Let G A (f) denote the vertex of the graph G (P) corresponding to 
the facet /. 

• Edges of G A (P). Two vertices G A (f) and G A (f) of G A (P) are connected by 
an edge in G A (P) exactly when their corresponding facets / and /' intersect in a 
ridge of P. (That is to say, the face / n /' is a ridge of P.) When this occurs, we 
say the facets / and /' are neighbors. 

The definition of the polar graph is motivated by the polar of a set. The polar of a set 
PCi c , denoted by P A is the set 

(1.5) P A = {y e M c | (x, y) < 1 for all x € P}. 

In the definition of the polar of a set P C M c , it will be convenient to assume that P is 
full-dimensional, and that the origin is in the interior of P, which can always be assumed by 
a suitable translation. The polar of an arbitrary set P C M c is a convex set. When P is a 
polytope, then the polar P A of P has some nice properties: 
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Lemma 1.2.4. Let P C M c be a polytope such that the origin is an interior point of P. 
Then, 

(1) The set P A is a polytope. 

(2) Polarization is an involution: P AA = P. 

(3) The lattices L(P A ) and L A (P) are isomorphic. 

(4) The graphs G(P A ) and G A (P) are isomorphic. 

is proved in Section 2.3 of [290J. If P is a polytope, then P A is called 



Lemma 



1.2.4 



the polar polytope. Any polytope Q that is combinatorially equivalent to P A is called a 
combinatorial polar of P. 

Example 1.2.5. Any d-dimensional cube and any d-dimensional cross polytope are combi- 
natorial polars of each other. Every polygon is polar to itself. Simplices in any dimension 
are also self-polar. 

Part <j3T) of Lemma 1.2.4 says that the face lattices of a polytope P and its polar P A are 



opposites. The facets (respectively vertices) of P A correspond to the vertices (respectively 
facets) of P. More generally, every (d — i)-face of P A corresponds to a face of P of dimension 
i — 1, and the incidence relations are reversed. Part Q of Lemma 1.2.4 says that the graph 
G(P A ) of the polar P A of a polytope P is isomorphic to the polar graph G A (P) of P. In 
particular, this means: 

Remark 1.2.6. Via polarity, studying the graphs of polytopes is equivalent to studying the 
polar graphs of polytopes. 

Of special importance are the simple and simplicial polytopes. A (i-polytope or d- 
polyhedron is called simple if every vertex is the intersection of exactly d facets. Equivalently, 
a d-polyhedron is simple if every vertex in the graph G{P) has degree exactly d. (Thus, the 
graph G(P) of a simple <i-polyhedron is <i-regular.) 

Example 1.2.7. Simplices and cubes are simple. Cross polytopes are not simple starting in 



dimension three. The four-dimensional polytope P3X3 defined in Example 1.0.5 is simple 



since every vertex in its graph (see Figure 1.10 on page 14) has exactly four neighbors 



A (i-dimensional polytope or polyhedron P is simplicial if every facet of P is a (d — 1)- 
simplex. Cross polytopes are simplicial polytopes. It follows from Part Q of Lemma 1.2.4 
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that the polar of a simple polytope is a simplicial polytope. In fact, the notions of simple 
and simplicial are polar to each other in the following way: the polytope P is simplicial if 
and only if the polytope P A is simple. For example, the <i-dimensional cross polytope is 
the polar of the d-cube. Since cubes are simple polytopes, cross polytopes are simplicial. 
The polar of a simplex is a simplex. Among polytopes of dimension three and higher, the 
simplices are the only polytopes which are at the same time simple and simplicial (see 
Exercise 0.1 in [290] ). 

Since the facets of simplicial polytopes are simplices, this in turn implies that all proper 
faces of a simplicial polytope are simplices. This is nice because then we can forget the 
geometry of a simplicial (i-polytope and look only at the combinatorics of the simplicial 
complex formed by its faces. This simplicial complex is a topological (d — l)-sphere. (For 
more about simplicial complexes see, e.g., |154| or [216J.) 

In Chapter [2j we will want to consider only the graphs of simple polytopes. By 



Remark 1.2.6, for simplicial polytopes, we are interested in the polar graph G A (P) of a 



polytope P, defined in Section |1.1[ 

1.3. Polytopes and optimization 

Linear programming problems are the first class of problems discussed in mathematical 
optimization. In this section, we describe the connections of linear and integer programming 
to polytopes and polyhedra. We also give an overview of how to solve a linear program. 

In linear programming, one is given a system of linear equalities and inequalities, and the 
goal is to maximize or minimize a given linear functional. We first describe linear programs 
in standard form. Fix a real-valued r x c matrix A, a vector b G R r , and a linear functional 
£ : R c — > R. In its standard form, a linear program is given by A, b, £ is the following 
problem: 

(1.6) Maximize £(x), subject to Ax = b and x > 0. 

The linear functional £ is called the objective function or the cost function. (In linear 
programming, it is no different to minimize or maximize: indeed, minimizing £ is the same 
as maximizing — £.) The equations Ax = b and inequalities x > are the constraints. The 
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coordinates of x = {x%, . . . , x c ) are called the decision variables. Suppose the r x c matrix A 
has rank r, with r < c, and let d = c — r. Then, the equality Ax = b defines a d-dimensional 
affine subspace whose intersection with the linear inequalities x > gives the feasibility 
polyhedron 

P = {x € R c | Ax = b and x > 0}. 

Note that the resulting polyhedron is a partition polyhedron. If the feasibility polyhedron 
P is bounded, then it is called the feasibility polytope. If the polyhedron P is non-empty, 
then the linear program is called feasible. A vector x belonging to the feasibility polyhedron 
P is called a feasible solution. A feasible solution x that maximizes the linear functional 
£ is called an optimal solution. Optimal solutions are typically denoted by x* . Typically, 
it is not enough to simply say that a linear program is feasible, and simply conclude that 
there is an optimal solution somewhere. Instead, we must actually find it! We typically 
want to know the actual coordinates of an optimal solution x* , and not just the maximal 
value G K alone. 

Remark 1.3.1. If the feasibility polyhedron P is feasible and unbounded, then, depending on 
the objective function £, we may run into the "danger" that there are no optimal solutions! 
(What could go "wrong"? The value of £(x) may be arbitrarily large for feasible vectors x 



in P.) See Figure 1.18 for an example. In this case, one desires the coordinates of a feasible 
solution x and a direction vector y £ M c such that 

(1) vectors of the form x + Xy belong to the polyhedron P for all real A > 0, and 

(2) the value of £(x + Xy) goes to infinity as X — > oo. 




Figure 1.18. An unbounded linear program: the arrow shows the direction 
in which £ increases. 



When this occurs, we say the linear program is unbounded with respect to £. When there is 
an optimal solution, we say the linear program is bounded with respect to £, even if the 
polyhedron P is unbounded. See Figure 1.19 
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Figure 1.19. A bounded linear program with respect to £. The feasibility 
polyhedron P is unbounded. 

The feasibility polyhedron P is convex and the level sets of the objective function £ 
are hyperplanes. It follows that an optimal solution of a linear program, if one exists, is 
found among the vertices of P. In fact, since the level sets of the linear functional £ are 
hyperplanes, the set of optimal solutions is a face of P, which is clear by the definition 
of a face. In particular, an optimal solution is found among the vertices of the feasibility 
polyhedron P of a bounded linear program. If the objective function £ is sufficiently generic, 
and if the linear program is bounded with respect to £, then the linear program has a unique 
optimal solution x*, and the solution x* is a vertex of P. 

Since optimal solutions of a linear program are found among the vertices of its feasibility 
polyhedron, and since the number of vertices in a polyhedron is finite, this leads to a natural 
first algorithm to solve a linear program. First, compute all of the vertices of the feasibility 
polyhedron. Then output a vertex whose £- value is largest. Unfortunately, this algorithm is 
not very practical. As the dimension of the feasibility polyhedron grows, there are simply 
way too many vertices to compute. In fact, there is even a more fundamental flaw. How do 
you even find one vertex of the feasibility polyhedron? Even in two dimensions, it is not 
obvious how to take a given set of linear inequalities and even find a solution. Consider the 
following example, which we will use as a running example. 

Example 1.3.2 (A sample application of linear programming). Maxwell is opening a new 
restaurant. He needs to decide how much should be spent at the grocery store each month 
and how many hours per month to schedule employees to maximize the restaurant's profit. 
Let x\ denote the amount to spend at the grocery store each month and let X2 denote the 
number of labor hours per month. Suppose that the restaurant's profit is determined by the 
function 

£(xi,x 2 ) = 19xi - Tx 2 . 
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Maxwell wants to know which pair x = (x±,X2) maximizes the value £(x) of the objective 
linear functional £, but there are some restrictions. Clearly, x\ > and X2 > 0. There are 
more constraints Maxwell must obey. Suppose, for example, that labor laws, union rules, 
and other factors further restrict the choice of x = (x\, X2) to: 
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In dimension two, it is easy enough to carefully graph the half-spaces, then compute the 
value of £ on each of the vertices. But in general, one cannot even u graph" the feasibility 
polyhedron P. How do you even find one feasible point x £ P? The first natural idea is to 
travel along the (for some i) until you hit the boundary of P. 

Even in this small example, that idea would fail. If it were not for the two inequalities 
xi,X2 > 0, how would we even know which direction to travel on the axes? (In fact, for this 
example, these two inequalities are redundant to the description of P.) A search along the 
axes would definitely fail in our example, since P lies completely in the relative interior of 
an orthant: the feasibility polyhedron does not even intersect the set {x = (xi, X2) G M 2 | 
X1X2 = 0}. Even assuming that P is in the relative interior of an orthant, in a c-dimensional 
ambient space, there are 2 C orthants! In addition, the polyhedron P, if it is non-empty, 
may be "far away from the origin," and if you try to do a search along a path that is 
piece-wise linear, how can you know how far to travel along a direction before turning in a 
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new direction? (In fact, how do you know whether the current direction of travel in your 
path moves you towards the feasibility polyhedron P, or away from it? Even worse, what if 
you could never find P because the feasibility polyhedron in empty? How would you even 
be able to detect this case?) 

To solve a linear program, we first describe how to convert any linear program into one 
whose constraints are of the form Ax = b, x > 0. (See |72] or [209J.) 

(1) Non- negativity. If any decision variable Xi does not have the constraint X, > 0, 
then we do a variable substitution. We will replace Xi by two new non-negative 
decision variables x\ > and x" > 0. Replace every occurrence of Xj by x\ — x". 
(The new decision variables x\ and x'( are called auxiliary variables. The modified 
linear program no longer mentions the old decision variable X,.) 

(2) Linear equations. Any linear equality constraints will simply be part of the 
matrix equation Ax = b, so these should not be modified (except for any variable 
substitutions from the previous step). 

(3) Linear inequalities. Turn each linear inequality into a linear equation by adding 
an auxiliary variable called a slack variable. The linear inequality a\X\ + • • • + 
a c x c < bo becomes a\X\ + • • • + a c x c + z = bo, with z > 0. The linear inequality 
a\x\ + • • • + a c x c > bo becomes a\X\ + • • • + a c x c — z = bo, with z > 0. 

Putting this all together proves the following fact, which says that any polyhedron can be 
written as a partition polyhedron. 

Lemma 1.3.3. Let Pci c be any polyhedron. Then there is a polyhedron 

P = {x e R z | Ax = b, x > 0} 
and a map ir : M c — > M c of the form 

X l^Xl , . . . , Xp, X^, . . . , Xg , X-y , . . . , Xq ) I r \X\ , . . . , Xp, X-y Xy , . . . , Xg Xg ) 

with p > p, such that the restriction Tr\p is a bijection from P to P. The polyhedra P and 
P are combinatorially equivalent. 
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Example 1.3.4. Let us convert the linear program of Maxwell's restaurant from Exam- 
ple 1.3.2\ to standard form. Both decision variables are already non-negative, so there is 



nothing to do in the first step above. (In the notation of Lemma 1.3.3, q = O.J 

We convert the ten non-trivial inequality constraints. The result is the new system 
Ax = b and x > 0, where 
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The linear program has been converted. Now, we want to maximize the objective function 
= • • • , X12) = 19xi — 7x2 over the feasibility polyhedron 
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and 



P = {x£ 



,12 



Ax = b, x > 0}. 



The feasibility polyhedron P is a partition polyhedron. 

Let A be a real- valued matrix of size r x c and let b be a vector in W . We now describe 
how to find a feasible solution x to a linear program in the standard form 



(1.7) 



Ax = b, x > 0, 



if one exists. (See [209] for more details.) This method will show that feasibility of one 
linear program is reduced to optimality of another linear program. (See [48J and [241 for a 
detailed explanation.) 

To begin, we assume, without loss of generality, that the vector b is in M> . Indeed, if 
any coordinate of the vector b 6 W is negative, we multiply it (and the corresponding row 
of the matrix A) by —1. From this set of r linear equality constraints, we construct a new 
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linear program. Add r non-negative auxiliary decision variables, say, z±, . . . ,z r > 0. For 
each i = 1, . . . , r, modify the ith constraint from 

a^ia^i H h a iiC x iyC = h > 

to 

^z, l^i, 1 "I - ' ' ' "I - ^i ; C-^2,C ~~\~ %i — ^2* 

In terms of matrices, the new linear system has the constraint matrix A' = [A \ I] , where / is 
the r x r identity matrix. There is a very easy initial feasible point for this modified system, 
namely {x\, . . . , x c , z\, . . . , z r ) = (0, . . . , 0, &i, . . . , b r ) 6 M>"g r . For this system, we use any 
algorithm for linear programming to minimize the objective function z) = z\ + • • • + z r 
subject to the above constraints. If the solution gives a point (x, z) € MS^ r with z) = 0, 
then the coordinate-erasing projection of (x, z) to x G M c is an initial feasible solution of the 



original linear program with the constraints (1.7). If the optimal solution (x,z) has strictly 



positive £ value £(x,z) > 0, then the original problem (1.7) has no feasible solution. 

In linear programming, one assumes that the decision variables are always real-valued 
quantities. This restriction is too strong in general. After all, one cannot hire half of an 
employee! In many settings, it is more natural to consider the situation where the decision 
variables must be integers. An integer program is a linear program with the additional 
constraint that the vector x of the decision variables has all integer coordinates. An integer 
program is considered solved if, given a linear functional £ : M c — > M. and a polyhedron 
P, one knows which integral lattice point x in P n Z c has the largest value of £. We do 
not discuss integer programming in any detail in this dissertation. (For more on integer 
programming see, e.g., [247J.) 



1.4. Transportation polytopes 

Transportation polytopes are well-known objects in operations research, mathematical 
programming, and statistics. In statistics, transportation polytopes are known as contingency 
tables. Surveys on the research in transportation polytopes and the transportation problem 
are found in the book by Yemelichev, Kovalev, and Kratsov (see [288]), in Vlach's survey 
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(sec |281j), in Klee and Witzgall's article (see |184j ). and in the recent survey of De Loera 
and Onn (see [99 ). 

1.4.1. Classical transportation polytopes 

We begin by introducing the most well-known subfamily of transportation polytopes. Fix 
two integers p,q G Z>o- The classical transportation polytope P of size p x q defined by 
the vectors u G W and v G is the polytope defined in the pq variables Xij G M>o 
(* £ \p\i3 ^ [q]) satisfying the p + q equations 

q p 

(1.8) J2 x hj =Ui (i£ [p]) and J^a*j = v i U G [?])• 

3=1 i=l 



Since P is defined by the p + g linear equations in (1.8) and the pq linear inequalities 



%ii > 0, it is a polyhedron. Since the coordinates x% j of P are non-negative, the summation 



conditions (1.8) imply that P is bounded, so classical transportation polytopes are polytopes. 



(In fact, < Xij < m.in{ui,Vj} for all i £ [p],j E [q].) After re-indexing the variables 



(* ^ \p\i3 e H) as ^1)^2, • • • , ^p9, the equations (1.8) and the inequalities > can be 
rewritten in the form 

P = {x G W q \ Ax = b,x > 0} 
with an appropriate 0-1 matrix A of size (p + q) x and a vector b G M p+9 . Thus, every 



classical transportation polytope is presented in the form (1.2) and is, thus, a partition 
polyhedron. The matrix A does not have full row rank. Indeed, the sum of the rows 
corresponding to the u-sum equations is the same as the sum of the rows for the f-sum 
equations. Since this is the only linear dependence among the rows of the matrix A, the rank 
of A isp+q— 1. Therefore the dimension d of the affine hull of P is pq—(p+q—l) = (p—l)(q—l). 
Therefore, 

Corollary 1.4.1. Every non-empty p x q classical transportation polytope has dimension 
d = pq— p — q + 1 and ambient dimension c = pq. 



Example 1.4.2. Let us reconsider the polytope Pzx3 from Example 1.0.5 Every point 
x £ P3x3 satisfies the equation x? + x§ + xg = 1, so let us add in this redundant equation. 
We give an equivalent definition to our earlier P3X3, defining it now as P3X3 = {x G M 9 | 
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^-3x3^ = b, x > 0}, where 



A, 



3x3 



1 1 1 

1 1 1 

1 1 1 

1 1 1 
1 1 1 
1 1 1 



and 



2 
7 
2 
5 
5 
1 



Up to permutation of rows and columns, the matrix A^ X 3 is the unique constraint matrix 
for 3x3 classical transportation polytopes. It is a 6 x 9 matrix of rank Eve. Thus, -P3X3 
is a four-dimensional polytope described in a nine-dimensional ambient space. By identi- 
fying the variables xi,...,xq (respectively) with the variables £1,1, xi^, £1,3, £2,1, • ■ • > x 3,3 
(respectively), the polyhedron -P3X3 is a 3 x 3 classical transportation polytope defined by 
the vectors v = (2, 7, 2) T and u = (5, 5, 1) T . 
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Figure 1.20. The vertex z\ after reindexing. 



The notation Xij is suggestive. We think of a point x = (^ij)ie]p],je[q\ e P — ^- pxq as a 



p x q table. For example, the vertex z\ = (2, 2, 1, 0, 5, 0, 0, 0, 1) defined in Example 1.0.11 



under the identification, is shown in Figure 1.20 In terms of tables, the equations in (1.8) 



are conditions on the row sums and column sums of tables that correspond to feasible points 
in P. 

The matrix A is called the defining matrix (or the constraint matrix) of p x q classical 
transportation polytopes. The vectors u and v are called marginals. For P to be non-empty, 
the vectors u and v should be non-negative. (The case when a coordinate Ui or Vj is zero is 
uninteresting, so we usually assume that u G M^g and v G K>q-) These polytopes are called 
transportation polytopes because of the following scenario: consider a model of transporting 
goods with p supply locations (with the ith location supplying a quantity of Uj), and q 
demand locations (with the jth location demanding a quantity of Vj). The feasible points 
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x = (^ij)ie[p],ie[gl m a P x 1 transportation polytope P model the scenario where a quantity 
of X{ j of goods is transported from the ith supply location to the jth demand location. 

Definition 1.4.3. If P is a non-empty p x q classical transportation polytope and x = 
( x i,j)iE\p],je[q\ JS m P> then X{j > for all i G [p] and j E [(/]. The pairs G [p] x [5] 
where acfj is strictly positive are called support entries. For a point x E P, we define the 
support set supp(x) to be E \p] x [g] | Xij > 0}. 

A necessary and sufficient condition for a classical transportation polytope to be non- 
empty is the sum of the supply margins equal the sum of the demand margins: 

Lemma 1.4.4. Let P he the p x q classical transportation polytope dehned hy the marginals 
u G M> and v £ K> - The polytope P is non-empty if and only if 

(1.10) E n > = E^ 

»e[p] 

The proof of this lemma uses the well-known northwest corner rule algorithm (see 
survey |238j or Exercise 17 in Chapter 6 of [288J). 



Proof. To show necessity, suppose X^iefol Ui ^ Yljelq] v r ^ substituting (1.8), the left 



and right sides of the equation (1.10) are not equal. Thus, the linear system is inconsistent 



and there is no solution x satisfying (1.8) 



For the converse, we construct a point x G P using the northwest corner rule algorithm: 
let x Pt q = mm{up,v q }. If the minimum is obtained at u p , set x p j = for all j 7^ q and 
replace v q with v q — u p . The rest of the point x = (xij) is obtained recursively as a point in 
a (p — 1) x q classical transportation polytope. Similarly, if the minimum is obtained at v q , 
set Xi t q = for all i / p and replace u p with u p — v q . The rest of the point x is obtained as 
a point in a p x (q — 1) transportation polytope. If the minimum was obtained at u p = v q , 
then the rest of the point x is obtained as a point in a (p — 1) x (q — 1) transportation 
polytope. □ 

Definition 1.4.5. A p x q classical transportation polytope P is generic if 

(1.11) E u ^E^- 
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for every non-empty proper subset Y C [p] and non-empty proper subset Z C [q] . (Of course, 



due to (1.10), we must disallow the case where Y = [p] and Z = [q].) 



Remark 1.4.6. Very soon we will introduce the notion of non-degenerate transportation 
polytopes. We will see in Lemma 1.4.12\ that the notions of genericity and non-degeneracy 



coincide for classical transportation polytopes. (But the condition defined above, which 
we need now, has no generalization to most multi-way transportation polytopes, which we 
introduce in the next section.) 

If P is a generic p x q classical transportation polytope P, then in the northwest 
corner rule algorithm used in the proof of Lemma |1.4.4[ the minimum is never attained 
simultaneously at u p and v q . This proves: 

Corollary 1.4.7. Let P / be a generic p x q classical transportation polytope. Then, 
there is a point x G P with | supp(x)| = p + q — 1. 

Let P be a p x q classical transportation polytope. To every point x £ P, we define 
a bipartite graph B(x), called the support graph of x. The graph B(x) is the following 
subgraph of the complete bipartite graph K Ptq : 

• Vertices of B(x). The vertices of the graph B(x) are the vertices of the complete 
bipartite graph K p>q . That is, the graph B(x) has p vertices of the first kind and q 
vertices of the second kind. 

• Edges of B(x). An edge (i, j) connecting vertex i of the first kind to vertex j of 
the second kind exists if and only if X{ j is strictly positive. That is to say, the set 
of edges is in one-to-one correspondence with the support set supp(x) defined in 
Definiton 1.4.3 The value of Xij > is called the weight or the flow of the edge 



Example 1.4.8. Let us consider the point z\ G P3X3 from Example 1.4.2 under the reindexing. 



Here, supp(zi) = {(1, 1), (1,2), (1,3), (2,2), (3,3)}. Figure \1.21\ depicts the graph B( Zl ). 

The graph properties of B(x) provide a useful combinatorial characterization of the vertices 
of classical transportation polytopes: 

Lemma 1.4.9. Let P be a p x q classical transportation polytope defined by the marginals 
u G M> and v G K> > an< ^ ^ x ^ P- Then the graph B(x) is spanning. The feasible point 
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Figure 1.21. The support graph B{z\) of the vertex z\ G P3x3- The nodes 
of B(z\) on the left correspond to the p = 3 supplies. The nodes on the right 
correspond to the q = 3 demands. 



x is a vertex of P if and only if B{x) is a spanning forest. Moreover, if P is generic, then x 
is a vertex of P if and only if B(x) is a spanning tree. 

Proof. Let x G P. The marginals are strictly positive. Since x is feasible, for each 
i G \p], there is a j G [q] such that xy > 0. Similarly, for each j G [g], there is an i G [p] so 
that Xij > 0. Thus, each node in B(x) is incident to an edge, so the graph B{x) must be 
spanning. 

Suppose x is a vertex of P. We argue that the graph B{x) cannot contain a cycle by 
contradiction. Suppose B{x) has a cycle. Let A > be the minimum weight X{j among all 
edges in the cycle. Since B(x) is bipartite, the cycle has an even number of edges. Decompose 
the cycle as the disjoint union of two edge sets E + and E_ so that every other edge along the 
cycle is in the set E + and every other edge is in £L . Let V = \ Y^u j)<^E+ e i,j ~ I j)e_B_ e *J • 
(Here, ey is the basis unit vector in the direction of the variable Xij.) The vector y is in 
the kernel of the defining matrix A of the transportation polytope P. Therefore, both x + y 



and x — y belong to P. By Lemma 1.0.10, the point x G P is not a vertex. Contradiction. 

For the converse, suppose x G P and suppose that the graph B(x) is a spanning forest. 
Any non-zero vector y G M pq that can be added to x and stay in the polytope P must be in 
the kernel of the defining matrix. The support of the vector y, thought of in terms of the 
bipartite graph B(y) induces a cycle. Thus, it cannot be that both x + y and x — y belong 



to P. By Lemma |l.0.10 x is a vertex of P. 



Now suppose that the transportation polytope P is generic. Suppose, for a contradiction, 
that B(x) is not a tree. Consider one of the connected components of B(x), say, the subgraph 
induced by the nodes Y C [p] and ZC [q]. Since B(x) is not a tree, at least one of Y or 
Z is a proper subset. Then, J2 i£Y u i = Yjez v ji which contradicts ( |1.11[ ). Therefore, the 
graph B(x) is a tree if x is a vertex of a generic classical transportation polytope. □ 
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As an immediate corollary, we get: 

Corollary 1.4.10. Let P be a generic p x q classical transportation polytope. Let x 
be a point in the transportation polytope P. Then x is a vertex of P if and only if 
| supp(x)| = p + q — 1. 

We now define a notion equivalent to genericity that will be needed in the next section: 

Definition 1.4.11. A transportation polytope is non-degenerate if is simple and it is of 
maximal possible dimension. 

The condition on maximality of dimension will be important in the next section where 
we introduce multi-way transportation polytopes. From our corollary, we can prove that 
the notions of genericity and non-degeneracy are equivalent for classical transportation 
polytopes: 

Lemma 1.4.12. Let P be a non-empty p x q classical transportation polytope. Then P is 
generic if and only if P is non-degenerate. 



Proof. The dimension is always maximal by Corollary 1.4.1 The statement follows 



from the equivalence in Corollary |1.4. 10 □ 



The support graph gives the following characterization of edges of classical transportation 
polytopes. (See Lemma 4.1 in Chapter 6 of |288| .) 

Proposition 1.4.13. Let x and x' be distinct vertices of a classical transportation polytope 
P. Then the vertices x and x' are adjacent if and only if the graph B(x) U B(x') contains a 
unique cycle. 

This can be seen since the bases corresponding to the vertices x and x' differ in the 
addition and the removal of one element (see |209j or |247j ) . For an elementary proof: 

Proof. Let E consist of all edges in the union of B{x) and B(x'), and define F = E 
to be the complement of E in \p] x [q]. Clearly, the polytope P n {x G M pxq \ Xij = 
for all € F} is one-dimensional. □ 



We now introduce the Birkhoff polytope, introduced by Birkhoff in [36J . We will discuss 
new properties of Birkhoff polytopes in Section 3.3 
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Definition 1.4.14. The pth Birkhoff polytope, denoted by B p , is the p x p classical 
transportation polytope with margins u = v = (1, 1, . . . , 1) T . 

The Birkhoff polytope is also called the assignment polytope or the polytope of doubly 
stochastic matrices. It is the perfect matching polytope of the complete bipartite graph K PtP . 
The following theorem states that the vertices of the Birkhoff polytope are the permutation 
matrices, and therefore that any doubly stochastic matrix may be represented as a convex 
combination of permutation matrices. 

Theorem 1.4.15 (Birkhoff- von Neumann Theorem). The p\ vertices of the pth Birkhoff 
polytope Bp are the permutation matrices of size p x p. 

This theorem was stated in the 1946 paper [36J by Birkhoff and proved independently 
by von Neumann in 1953 (see [284J). Equivalent results were shown earlier in the 1894 
thesis [264j of Steinitz, and the theorem also follows from the 1916 papers [186J and [187 
by Konig. (For a more complete discussion on the history of the Birkhoff-von Neumann 
Theorem, see the preface to [207].) The vertices of a Birkhoff polytope are examples of 
semi-magic squares: 

Definition 1.4.16. Let p,a G Z>o- A p x p semi-magic square of order a is a p x p table 
x = (xjj)j g [pi jgjp] of numbers in Z>o such that 

v p 
^Xij = a (i G [p]) and ^Xij = a (j G [p]). 
j=i i=l 

That is to say, a semi-magic square is an integral lattice point in a transportation polytope 
where every row and column sum is the same, namely a. The number a is called the magic 
number. 




Figure 1.22. A 4 x 4 magic square of u = 33. 
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A p x p magic square of order a is a semi-magic square that also satisfies the two 
additional equations 

p p 
J, = <? and x itP - i+ i = a. 
3=1 i=l 

In other words, magic squares have the additional condition that the two diagonals also sum 



to the magic number a. See Figure 1.22 for an example. 



Definition 1.4.17 (Generalized Birkhoff polytopes). We can generalize the definition of 
the Birkhoff polytope to rectangular arrays. The generalized Birkhoff p x q polytope is the 
p x q classical transportation polytope with u\ = ■ ■ ■ = u p = q and v\ = ■ ■ ■ = v q = p. (This 
polytope is also known as the central transportation polytope of size p x q.) 

1.4.2. Multi-way transportation polytopes 

Classical transportation polytopes are also called 2-way transportation polytopes since the 
coordinates Xij have two indices. We can consider generalizations of 2-way transportation 
polytopes by considering coordinates in three or more indices (e.g., sc^fe or etc.). We 

introduce two natural generalizations of 2-way transportation polytopes to 3- way transporta- 
tion polytopes, whose feasible points are p x q x s tables of non-negative reals satisfying 
certain sum conditions: 

• First, consider the 3- way transportation polytope of size p x q x s defined by 1- 
marginals: Let u = (ui, . . . , u p ) G W, v = (y±, . . . , y q ) G M 9 , and w = (wi, . . . , w s ) G 
M s be three vectors. Let P be the polyhedron defined by the following p + q + s 
equations in the pqs variables Xij s k G M>o (« G \p],j G [q],k G [s]): 

(1.12) ^2 x i,j,k = Ui,Vi ^2xi,j,k = Vj,Vj ^2 x i,j,k = w k ,\/k. 

j,k i,k i,j 

Observe that a necessary and sufficient condition for the polytope P to be non-empty 
is that 

pqs 
i=l j=l k=l 

and consequently the polytope P is defined by only p+q+s— 2 independent equations. 
(In the book [288J, 3-way transportation polytopes defined by 1-marginals are known 
as 3-way axial transportation polytopes.) 
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• Similarly, a 3-way transportation polytope of size p x q x s can be denned by 
specifying three real-valued matrices U, V, and W (respectively) of sizes q x s, 
p x s, and p x q (respectively). These three matrices specify the line-sums resulting 
from fixing two of the indices of entries and adding over the remaining index. That 
is to say, the polyhedron P is defined by the following pq + ps + qs equations, called 
the 2-marginals, in the pqs variables scjj fe € K>o satisfying: 

y~] x i,j,k = Uj,k,Vj,k y^Xi,j,k = Vj,k,Vhk y^ x i,j,k = Wj t j,Vi,j. 
i j k 

One can see that in fact only pq + ps + qs — p — q — s + 1 of the defining equations 
are linearly independent for feasible systems. (In [288], the 3-way transportation 
polytopes defined by 2-marginals are called 3-way planar transportation polytopes.) 

Remark 1.4.18. Of course, one can easily generalize these concepts to u-way tables for any 
integer u and fi-marginals for any 1 < \i < u. In [288], u-way transportation polytopes 
defined by 1-marginals are called axial transportation polytopes while u-way transportation 
polytopes defined by (oj — 1) -marginals are called planar transportation polytopes. Clearly, 
u-way transportation polytopes are partition polyhedra. 

Observe that the 3-way transportation polytopes of size p x q x s defined by 1-marginals 
generalize the classical transportation polytope of size p x q, when s = 1 and w\ = Ui = 
^2 v j- A less trivial rewriting of the classical p x 2 transportation polytope as a 3-way 



transportation polytope of size p x 2 x 2 defined by 2-marginals is given in Theorem 3.0.2 
Some of the proofs will require our 3-way transportation polytopes to be non-degenerate. 
We use the same definition as we did for classical transportation polytopes: 

Definition 1.4.19. A multi-way transportation polytope is non-degenerate if is simple and 
it is of maximal possible dimension. 

The maximum possible dimension for p x q x s transportation polytopes defined by 1- 
marginals is pqs —p — q — s + 2. The maximum possible dimension for pxqx s transportation 
polytopes defined by 2-marginals is (p — l)(q — l)(s — 1). Graphs of non-degenerate 
transportation polytopes are of particular interest because they have the largest possible 
number of vertices and largest possible diameter among the graphs of all transportation 
polytopes of given type and parameters (e.g., pqq, and s). Indeed, if P is a degenerate 
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transportation polytope, by carefully perturbing the marginals that define P we can get a 
non-degenerate polytope P'. (A careful explanation of how to do the perturbation in the 
case of classical transportation polytopes is presented as Lemma 4.6 in Chapter 6 of [288 
on page 281.) The perturbed marginals are obtained by taking a feasible point x in P, 
perturbing the entries in the table and using the recomputed sums as the new marginals for 
P'. The graph of P can be obtained from that of P' by contracting certain edges, which 
cannot increase either the diameter nor the number of vertices. 

Definition 1.4.20. In Definition 1.4.17\ we presented a generalization of Birkhoff polytopes 
to p x q rectangular arrays. The Birkhoff polytope has the following generalizations in the 
3-way setting: 



(1) The generalized Birkhoff 3-way axial polytope is the axial 3-way transportation 
px qx s polytope whose 1-marginals are given by the vectors u = (qs, . . . , qs) G MP, 
v = (ps, . . . ,ps) G M q , and w = (pq, . . . ,pq) G M s . 

(2) The generalized Birkhoff 3-way planar polytope is the planar 3-way transportation 
p x q x s polytope whose 2-marginals are given by the q x s matrix Uj k = P, the 
p x s matrix V{k = q, and the p x q matrix W%j = s. 



1.4.3. The transportation problem and related problems 

The transportation problem was formulated by Hitchcock in 1941 (see |157| ). A similar 
problem was studied by Monge in 1781 (see [218j and pages 227-228 in the book [31] by 
Berge). Kantorovich studied Monge 's formulation of the problem in 1942 (see |171j ). A 
basic summary of the problem and some algorithms is found in [240J. In its original form, 
the standard transportation problem is defined in the following way: given a p x q classical 
transportation polytope P defined by the marginals u G MP and v €M q and a p x q matrix 
£ = (£i,j)ie\p]je[q] with real entries, solve the linear program 

Minimize £i,j x i,j subject to x = (xij) G P. 

*S[p] j€[q] 

The transportation problem is a special case of the minimum cost flow problem (see, e.g., [6] 
or [15]). 
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We describe some variants of the transportation problem. In the usual transportation 
problem, the only variables are Xij, which is to say that the model only allows transportation 
of goods between sources and demands. In many applications, this assumption is too strong. 
One may desire more flexibility, such as allowing goods to be transported between supplies 
and between demands. There might also be points through which goods can be shipped from 
a supply to a demand. These more general problems are called transshipment problems. Any 
transshipment problem can be easily converted into an equivalent transportation problem 
(see [6]). 

Another variant of the transportation problem is the assignment problem. An assignment 
problem is an integer programming problem on the Birkhoff polytope. (Since the constraint 
matrix for classical transportation polytopes is totally unimodular, i.e., every non-singular 
submatrix has determinant ±1, the integer programming problem is equivalent to the linear 
programming relaxation (see, e.g., [138] ). but this is not true for general integer programs.) 
It is the integer version of the transportation problem. To be clear, the problem is: given a 
p x p classical transportation polytope P defined by the marginals u = (1, . . . , 1) G W and 
v = (1, . . . , 1) G W and a p x p matrix £ = (S,i,j)ie[p],je[p] with real entries, solve the integer 
program 

Minimize £i,j x i,j subject to x = (xij) G Pf]7P xp . 

i£\p] j£\p] 

This problem is also known as the linear assignment problem. See the recent book [60] by 
Burkard et al. on the assignment problem. Since u = v = (1, 1, . . . , 1) G M p , it follows that 
P n 7L pxp = P n {0, l} pxp , and therefore feasible solutions of this integer program are 0-1 



matrices. In fact, by Theorem 1.4.15, they are permutation matrices. The problem is called 
the assignment problem since it models the situation where p people need to be assigned to 
p jobs. In the model, each person is assigned exactly one job, and each job is performed 
by exactly one person. The ith person is assigned to the jth job exactly when Xij = 1. 



Otherwise, X{j = 0. By Lemma 1.4.9, the problem can be converted into one of finding a 
maximum weight matching in a weighted bipartite graph. 
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1.4.4. A survey of transportation polytopes and the transportation prob- 
lem 

Chapter [3] presents our new results on transportation polytopes. In this section, we survey the 
state of the art in transportation polytopes and the transportation problem. The survey [281J 
by Vlach, the 1984 monograph by Yemelichev, Kovalev, and Kratsov (see |288j ). and the 
paper |184j by Klee and Witzgall summarized the status of transportation polytopes up to 
the 1980s. See also the recent survey of De Loera and Onn (see [99] ) on recent advances in 
transportation polytopes. 

Applications of transportation polytopes to statistics 

Transportation polytopes appear in statistics, where they are called contingency tables. 
See the survey by Diaconis and Gangolli (see [112J). A major practical application for 
transportation polytopes is in the study of statistical data security. The table entry security 
problem has been studied by Chowdhury et al. (see |70j ). Duncan (see |115j and |116j ) 
and Irving (sec [164J), among others. Heuristic algorithms that study table entry security 
have been studied by Fienberg (see |125j ) and Buzzigoli and Giusti (see |61|). In [77J, Cox 
shows that these heuristic algorithms are not exact and presents an alternative algorithm. 

Table entry security problems are related to the study of magic squares and lattice points 
in transportation polytopes. In |78j, Cox characterizes conditions needed on multi-way 
tables to guarantee that continuous bounds on integer-valued entries exist and are integral. 
The statistical data security problem is also related to the study of bounds on a particular 
entry (see, e.g., [T13J by Dobra and Fienberg). 

In [102) . De Loera and Onn give a complete description of the complexity of existence, 
counting, and entry-security in multi-way tables. Mehta and Patel (see |214| ) describe an 
exact test of significance for the independence of rows and columns of p x q contingency 
tables. 
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Transportation polytopes and optimization 

Before discussing the transportation problem, we discuss other questions in optimization 
theory related to the geometry and combinatorics of transportation polytopes. In [232J, 
Pak analyzes the simplex method for the pth. Birkhoff polytope B p . Pak defines a certain 
linear functional £ on M pxp so that the pth Birkhoff polytope B p has a ^-monotone path of 
length C ■ n\, for a universal constant C > 0. However, Pak (see |232j ) also shows that the 
expected average running time of the simplex method on the Birkhoff polytope with respect 
to £ is in 0(plogp). 

Karp (see [TT3J ) proved that solving assignment programs with an arbitrary cost function 
on generalized 3-way p x p x p planar Birkhoff polytopes is NP-hard. (For more on NP- 
hardness and other complexity classes, see |73|.) 

De Loera, Hemmecke, Onn, and Weismantel (see [94 ) prove there is a polynomial time 
algorithm that, given s and fixing p and q, solves integer programming problems of 3-way 
transportation polytopes of size p x q x s defined by 2-marginals, over any integer objective. 

In |93j De Loera, Hemmecke, Onn, Rothblum, and Weismantel present a polynomial 
oracle-time algorithm to solve convex integer maximization over 3-way planar transportation 
polytopes, if two of the margin sizes remain fixed. More recently (in (156j ) Hemmecke, Onn, 
and Weismantel prove a similar result for convex integer minimization. 

In [83] . Cuturi analyzes the Monge-Kantorovich distance between the vectors u and v in 
terms of the transportation polytope with margins u and v. 

Solving the transportation and assignment problems 

Many researchers have studied algorithms to solve the transportation problem (see, e.g., [188J, 
[236J, [256) . [257J, and 277\). See the recent book [60] for more on the assignment problem. 
In 1955, Kuhn (see |194j ) introduced the Hungarian method, a combinatorial algorithm for 
solving the assignment problem, named in honor of Hungarian mathematicians Konig and 
Egervary. Later, Kuhn (see [1951 ) developed a geometric model for variants of the algorithm. 
In 1957, Munkres (see |223j ) proved that the algorithm is strongly polynomial. Edmonds 
proved (see [TT8]) Kuhn's original algorithm was in 0(n 4 ). Edmonds and Karp (see |120j ) 
and independently Tomizawa (see [278]) modified the algorithm to 0(n 3 ). (In [19J, Balinski 
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and Gomory present an algorithm which is dual to the Hungarian algorithm.) An extension 
to the Munkres Hungarian algorithm is presented in |47j . Variants of the Hungarian 
algorithm are studied in |179j . In [139J, Gill and Linusson study a multi-way analogue 
of the assignment polytope. In [238], Queyranne and Spieksma describe formulations 
and applications for multi-index transportation problems. In [131], Ford and Fulkerson 
describe a simplified computing procedure for the transportation problem based on Kuhn's 
combinatorial algorithm for the assignment problem and a labeling process for solving 
maximal flow problems in networks. See the books [119] and [207] for more details on 
the matching problem. Hoffman (see |158j) summarizes significant advances in research 
on the transportation problem. In [258], Srinivasan and Thompson present a computer 
code for the transportation problem that is more efficient than the primal-dual method. 
In [T51J, Harris describes a variant of the code by Srinivasan and Thompson which greatly 
reduces the computation time for long and narrow transportation problems. A summary of 
computational results on various algorithms for the assignment problem is given in |127j . 
A similar study for transportation problems is given in [200]. Miiller-Merbach (see [222] 
and |22X]) describes an improved starting algorithm for the Ford- Fulkerson method. 

In [321 . Bertsekas and Castanon solve the transportation problem by converting it into 
an assignment problem and using a generalization of Bertsekas' auction algorithm. Each 
iteration of the auction algorithm of Bertsekas never decreases the linear functional being 
maximized. Bertsekas and Castanon (see |32j ) show that this modified version of the auction 
algorithm is very efficient for certain types of transportation problems. 

Since the transportation problem is a linear programming problem, one can solve the 
problem using the simplex method. Dantzig (see [87]) studied the behavior of the simplex 
method on transportation problems. In [11], Arsham and Kahn present a simplex-type 
alternative to the stepping stone method of Charnes (see |66J) to solve the transportation 
problem. In [234], Papamanthou et al. present an experimental computational study to 
compare the classical primal simplex algorithm and the exterior point algorithms for the 
transportation problem. In [165], the authors describe what the so-called dual matrix 
approach to solving the transportation problem. 
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In [225], Nunkaew and Phruksaphanrat solve a particular two-objective version of the 
transportation problem using lexicographic goal programming. For more on multi-objective 
programming, see the book |121j by Ehrgott. 

Solving special cases of the transportation problem 

Special cases of the transportation problem can sometimes be solved faster than the general 
transportation problem. For example, in [22j . Barnes and Hoffman describe two special 
families of transportation problems that can be solved using a greedy algorithm. Glicksman 
et al. (see |140| ) considers the special case for which the number of demands is many times 
greater than the number of supplies. The transportation problem with exclusionary side 
constraints has been studied by Goossens and Spieksma (sec [143J) and Sun (see |270j ). 

Signature algorithms solve certain classes of transportation problems in a number of 
steps bounded by the diameter of the dual polyhedron. Using signature algorithms, Balinski 
and Rispoli (see [20J) prove the Monotone Hirsch Conjecture holds for a certain class of 
classical transportation polytopes called signature polytopes. 

Solving generalizations and variants of the transportation problem 

Many generalizations and variants of the standard transportation problem have been studied. 
Appa (see [9]) surveys the solutions to 81 practical variants of the transportation problem by 
considering negative costs, and variants obtained by inequality constraints on the margins. 
In this section, we survey many of the generalizations and variants of the transportation 
problem. 

In |21j . Bammi formulates a generalized- indices transportation problem and presents an 
algorithm for its solution. In [14J, Baiou and Balinski study the stable allocation problem, 
which generalizes 0-1 stable matching problems to real- valued quantities. 

In [126], Finkelshtein modifies the standard p x q transportation problem by adding 
a decision variable y > 0, a p x q matrix Z, a constant z, and the additional constraint 
SiefpHeM ^i,j x i,j ~ V = z - Finkelshtein (see |126j ) presents an iterative method for its 
solution. In the note [250 ], Shkurba presents an exact method to solve this modified 
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transportation problem. Lourie (see [206j ) analyzes the stepping-stone method for solving a 
generalized transportation problem. 

In general, the quadratic assignment problem is NP-hard (see [243]). In [229J, Onn and 
Rothblum show that certain instances of the positive definite quadratic assignment problem 
are tractable by converting these problems into convex combinatorial optimization problems. 

The transportation paradox is the phenomenon in the transportation problem that the 
largest total transportation cost may not occur at the highest total quantities shipped. The 
paradox was discovered by Charnes and Klingman (see |67j ). and independently by Szwarc 
(see |271j ). A sufficient condition for the paradox to occur was studied by Adlakha and 
Kowalski (see [1]). In |267j . Stor0y extends the work of Deineko, Klinz, and Woeginger 
(see |107] ). and which present instances of the transportation problem that are immune to 
the transportation paradox. Liu (see |205| ) investigates the paradox when the demand and 
supply quantities are varying. Arsham (see |10j ) studies the paradox via post-optimality 
analysis methods. 

In |203| . Lin introduces a genetic algorithm to solve transportation problem with fuzzy 
objective functions. Lin and Tsai (see [204J) investigate solving the transportation problem 
with fuzzy demands and fuzzy supplies using a two-stage genetic algorithm. Li, Ida, and Gen 
(sec [202J) present an improved genetic algorithm for solving the fuzzy multiobjective solid 
transportation problem. When the transportation problem is associated with additional 
fixed cost for establishing the facilities or fulfilling the demand of customers, then it is 
called fixed charge transportation problem. Jo, Li, and Gen (see [166J) apply the spanning 
tree-based genetic algorithm approach for solving the non-linear fixed charge transportation 
problem. The fixed-charge problem is a non-linear programming problem of practical interest 
in business and industry. One of its variations, the fixed-charge transportation problem, 
where fixed cost is incurred for every route that is used in the solution, along with the 
variable cost that is proportional to the amount shipped is studied by Kowalski and Lev 
(see [189]). 

In |82j . Currin studies the transportation problem with inadmissible routes. In [196J, 
Kuno and Utsunomiya address a method for solving two classes of production-transportation 
problems with concave production cost. In [163J, Imam et al. describe a method of solving 
the transportation problem based on the object-oriented programming model. In [153J, 
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Hartwick generalizes the Hitchcock-Koopmans analysis to take account of variable supplies 
and demand for a product at diverse geographically-separated locations. 

Combinatorics of transportation polytopes 

The geometric combinatorial structure of transportation polytopes is a very active area of 
study. In [184], Klee and Witzgall examine the combinatorial structure (and in particular, 
the number of vertices) of transportation polytopes. Klee and Witzgall conjectured (see [184] ) 
and Bolker proved (see |43j ) that when p and q are relatively prime, the maximum possible 
number of vertices among p x q classical transportation polytopes is achieved by the 
generalized p x q Birkhoff polytope. In [233J, Pak presents an efficient algorithm for 
computing the /-vector of the generalized Birkhoff polytope of size p x q when q = p + 1. 
Hartfiel (see |152j ) and Dahl (see |86j ) describe the supports of certain feasible points in 
classical transportation polytopes. 

The diameters of classical transportation polytopes have been studied extensively. Balin- 
ski and Rispoli (sec [20J) explain why Kravtsov's supposed proof (see [191] ) of the Hirsch 
Conjecture for transportation polytopes is incomplete. In |79j . Cryan et al. analyze a natural 
random walk on the graph of the transportation polytopes. (In [232J, Pak proves that 
nearest neighbor random walk does not mix fast on all 0-1 polytopes, but that it does mix 
fast on Birkhoff polytopes.) In [18J, Balinski proves that the Hirsch Conjecture holds and is 
tight for dual transportation polyhedra. In |16j . Balinski and Russakoff compute /-vectors 
of dual transportation polyhedra by analyzing partitions of p + q — 1 . McKeown and Rubin 
(see [212] ) and Oviedo (see [231J) analyze the adjacencies of vertices in transportation 
polytopes. 

Dyer and Frieze's (see [117] ) polynomial diameter bound for totally unimodular polytopes 
based on random walks applies to classical transportation polytopes. Yemelichev, Kovalev, 
and Kravtsov (see Theorem 4.6 in Chapter 6 of [288J) and Stougie (see |268| ) present 
improved polynomial bounds. This was improved to a quadratic bound by van den Heuvel 
and Stougie in [279J. The first linear upper bound on the diameter classical transportation 
polytopes was proved in 2006 by Brightwell, van den Heuvel, and Stougie (see |51| ). although 
this was recently improved by Hurkens (see p_62j ) . 
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In [T8J , Balinski proved that the Hirsch Conjecture holds for the bounded polytopes 
resulting from the intersection of a dual transportation polyhedron with a certain hyperplane. 



In joint work with De Loera, Onn, and Santos (see [97J and Section 3.6 in this thesis), 
we prove a quadratic bound on the diameter of 3- way axial transportation polytopes (see 



Theorem 3.0.3). 

In |281j . Vlach surveys conditions for the non-emptiness of the 3-way planar transporta- 
tion polytope. Schell (see [246] ). Haley (see [148] and [US]), Moravek and Vlach (see [2l9] 
and |220j), and Smith (see |252j, [253J, and |254j) prove necessary but not sufficient 
conditions on the margins for a 3-way transportation polytope defined by 2-marginals to 
be non-empty. The note [123] presents criterion for a polytope to belong to the class of 
multi-index planar transportation polytopes with a maximum number of vertices. Kravtsov 
et al. (see [190] . [T92J , and [193J) investigates combinatorial properties of multi-way 
transportation polytopes. 

The 3-way transportation polytopes are very interesting because of the following univer- 
sality theorem of De Loera and Onn in [101J. 

Theorem 1.4.21. Let P be a rational convex polytope. Then, there is a 3-way planar 
transportation polytope P isomorphic to P. Moreover, there is a 3-way axial transportation 
polytope P which has a face F isomorphic to P. 

Isomorphic here means that, in particular, the polytope P or its face F have the same 



face poset as P. (The polytope P is presented in the form shown in Lemma 1.3.3 ) The 
result in [101] also says that, given the polytope P, there is a polynomial time algorithm to 
construct P and the isomorphism mentioned. (See also [104| and [103 .) 

In [59] . Bulut and Bulut study the axial 4-way transportation polytopes and study 
algebraic characterizations of them. In [58] , the authors describe a network flow problem as 
a planar transportation polytope problem and solve the problem in terms of eigenvectors of 
certain matrices. 

Bolker (see [44] ) defines an analogue of the support graph B(x) for w-way transportation 
polytopes and studies its homological properties. (See [154J for an in-depth treatment of 
homology.) Lenz (see [2011 ) proves a degree bound on a certain reduced Grobner basis for 
classical transportation polytopes. 
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In |95j . De Loera et al. report on, among other things, the computational results of 
counting lattice points in multi-index transportation polytopes using the software LattE 
(see [92j). In [147J, Haase and Paffenholz prove that the toric ideals of every 3x3 
transportation polytope T is quadratically generated, if T is not a multiple of the third 
Birkhoff polytope B%. 

Birkhoff polytopes 

The computation of volumes and triangulations of the Birkhoff polytope is related to the 
problem of generating a random doubly stochastic matrix (see |65j ). Here, we survey past 
results on the triangulations, volumes, and lattice point enumeration of Birkhoff polytopes. 
In Section [3 .3[ we present our new results on the existence of non-regular triangulations of 
Birkhoff polytopes. 

The explicit volume of the pth Birkhoff polytope B p is known (see |235j ) up to p = 10. 
In [232J, Pak proves that the volume of the Birkhoff polytope is equal to the volume of a 
different polytope. Canfield and McKay (see |62j) present an asymptotic formula for the 
volume of the pth Birkhoff polytope B p . 

In [98], De Loera, Liu, and Yoshida present a generating function for the number of 
semi- magic squares. Using this generating function, De Loera et al. (see [98J) present 
formulas for the coefficients of the Ehrhart polynomial of the pth Birkhoff polytope B p , and 
a combinatorial formula for the volume of the pth Birkhoff polytope B v for all p. Barvinok 
(sec |26j) presents an asymptotic upper and lower bounds for the volumes of p x q classical 
transportation polytopes and the number of p x q semi-magic rectangles. In |64j, Carlitz 
describes lattice points of dilations of the Birkhoff polytope using exponential generating 
functions. 

Counting magic squares and lattice points in (dilations of) Birkhoff polytopes is useful for 
computing their volumes. This problem has been studied by Ahmed (see [4] and [5]), Beck 
and Pixton (sec |29j), Beck, Cohen, Cuomo, and Gribelyuk (see |28j), Chan and Robbins 
(sec |65j), Diaconis and Gamburd (see [lllj), Halleck (sec [150J), Hemmecke (see [155J), 
and Stanley (see [2591 and |260j), among others. 
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In |54j , Brualdi and Gibson use the permanent function to determine geometric properties 
of the Birkhoff polytope. In [55] , Brualdi and Gibson study the graph of the Birkhoff 
polytope. In |56j, Brualdi and Gibson investigate the afhne and combinatorial properties of 
the Birkhoff polytope. In [52J, Brualdi and Gibson investigate the extreme points, faces and 
their dimensions of the convex polytope of doubly stochastic matrices which are invariant 
under a fixed row and column permutation. In [124 , Escolano et al. establish a link between 
Birkhoff polytopes and heat kernels on graphs. 

Additional results are known for subpolytopes of the Birkhoff polytope. One can 
consider permutation polytopes, obtained as the convex hull of some vertices of a Birkhoff 
polytope. (Note that this notion of permutation polytope is distinct from the permutation 
polytopes of Billera and Sarangarajan in |35j .) In |227j . Onn analyzes the geometry, 
complexity and combinatorics of permutation polytopes. In [27J, Baumeister et al. study 
the faces and combinatorial types that appear in small permutation polytopes. Brualdi 
(see [53 ) investigates the faces of the convex polytope of doubly stochastic matrices which 
are invariant under a fixed row and column permutation. The pth tridiagonal Birkhoff 
polytope is the convex hull of the vertices of the Birkhoff polytope whose support entries are 
in £ [p] x [p] | \i — j\ < 1}. In [84J, da Fonseca et al. count the number of vertices 

of tridiagonal Birkhoff polytopes. In |76j, Costa et al. present a formula that counts the 
number of faces tridiagonal Birkhoff polytopes. 

Costa et al. (see [74J) define a pth acyclic Birkhoff polytope to be any polytope that is 
the convex hull of the set of matrices whose support corresponds to (some subset of) the 
edges (including loops) of a fixed tree graph. In |75j . Costa et al. count the faces of acyclic 
Birkhoff polytopes. In |74j . Costa et al. prove an upper bound on the diameter of acyclic 
Birkhoff polytopes, which generalizes the diameter result of Dahl in [85 . 

Let the pth even Birkhoff polytope be the convex hull of the \p\ permutation matrices 
corresponding to even permutations. In [81j, Cunningham and Wang confirm a conjecture 
of Brualdi and Liu (see |57] ) that the pth even Birkhoff polytope cannot be described as the 
solution set of polynomially many linear inequalities. In [161] , Hood and Perkinson describe 
some of the facets of the even Birkhoff polytope and prove the conjecture of Brualdi and Liu 
(sec [57 ) that the number of facets of the pth even Birkhoff polytope is not polynomial in p. 
In |282j . von Below shows that the condition of Mirsky given in [217J is not sufficient for 
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determining membership of a point in an even Birkhoff polytope. Cunningham and Wang 
(sec [81 ) also investigate the membership problem for the even Birkhoff polytope. In [283 , 
von Below and Renier describe even and odd diagonals in even Birkhoff polytopes. 
In [69 i, Cho and Nam introduce a signed analogue of the Birkhoff polytope. 

1.5. Enumeration of partition polytopes with fixed constraint 
matrix 

In this section, we discuss the necessary mathematical background to justify the proof of 
the following theorem: 

Theorem 1.5.1. Given a fixed r x c real-valued matrix A, there is a finite algorithm that 
outputs a list of simple polytopes {Pi, . . . , P z } so that any simple polytope of the form 

P = {x G R c | Ax = b, x > 0} 

is combinatorially equivalent to one of the Pi 's. 

We will describe how to implement this procedure. To make everything precise, we will 
illustrate everything using a running example. Our running example for the matrix A will 
be the constraint matrix A = A2x3 of 2 x 3 classical transportation polytopes. In particular, 
using this methodology, we computed a complete catalogue of non-degenerate transportation 
polytopes with small margin sizes: 

Theorem 1.5.2. The following tables (see Appendix [A|) give a complete catalogue of 
transportation polytopes of small sizes: 



The only possible numbers fo of vertices of non-degenerate 2 x 3, 2 x 4, 2 x 5, 3 x 3, 



and 3x4 classical transportation polytopes are those given in Tables A.l\ A.2, A. 3 
A.4\ and A. 5, respectively. 



The only possible numbers fo of vertices of non-degenerate 2x2x2 and 2x2x3 
axial transportation polytopes are those given in Ta bles [AT6| and [A . 7\ respectively. 

Every non-degenerate 2x2x4 axiai transportation polytope has between 32 
and 504 vertices. Every non-degenerate 2x3x3 axial transportation polytopes has 
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between 81 and 1056 vertices. The number of vertices of non-degenerate 3x3x3 
axial transportation polytopes is at least 729. 

The only possible numbers fo of vertices of non-degenerate 2x2x2 and 2x3x3 



planar transportation polytopes are those given in Tables A. 8 and A.9, respectively. 

Every non-degenerate 2x3x4 planar transportation polytope has between 7 
and 480 vertices. 

The catalogue (see Appendix [A]) was obtained through an exhaustive enumeration of 
combinatorial types of transportation polytopes whose foundation is the theory of secondary 
polytopes and parametric linear programming. In some cases when the full enumeration 
was impossible we can at least obtain lower and upper bounds for the number of vertices 
that these polytopes can have. 



We begin by recalling Lemma 1.0.12 which said that x G M c is a vertex of the partition 
polyhedron P = {x G M c | Ax = b, x > 0} if and only if x is a basic feasible solution 
corresponding to the basis A of A. Moreover, if the polyhedron P is assumed to be simple 
then the basic feasible solution must be strictly positive on the entries corresponding to 
the basis A. Geometrically, a basis A C A produces a vertex of a simple polyhedron P if 
and only if b lies in the interior of the cone generated by A. In conclusion, the vertices 
of a simple partition polyhedron P are in bijection with the bases that define cones in W 
containing the vector b G W within their interior. 

We discuss the enumeration of partition polyhedra (as all transportation polytopes are) 
by discussing what happens to the combinatorics of Pf, = {x G M c \ Ax = b, x > 0} as the 
vector b G M r changes while the r x c matrix A remains fixed. (For the case of transportation 
polytopes, the matrix A describes the type and size of the transportation polytope, and b is 
the vector given by the margins.) This study, for general matrices, is known as parametric 
linear programming (see Chapter 1 Section 2 and Chapter 9 Section 5 of [105 J). Since the 
vector b will vary, we use the notation Pf, to describe the partition polyhedron given by 
b G M. r for our fixed matrix A. 

What happens when we let the vector b vary? If the polyhedron Pf, is simple and the 
change in b is small, the facets of Pf, move but the combinatorial type of Pi, does not change. 
Only when a basic solution changes from being feasible to not feasible, or vice versa, the 
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combinatorics of Pb (that is, the face lattice and, in particular, the graph of Pb) can change 
(see 00] and [TBT] K 

Put differently: Let Y>a denote the set of all cones generated by bases of A. Let dT,A 
denote the union of the boundaries of all elements of Y>a- The connected components of 
cone(A) \ 8Y>a are open convex polyhedral cones called the chambers of A. (To be precise, 
the connected components of cone(A) \ dY<A are the relative interiors of polyhedral cones.) 
We call the chamber associated to a given vector b G M r the intersection of the interiors of 
simple cones that contain the vector b in their interior. Every sufficiently generic vector 
b G cone(A) is in a chamber (as opposed to lying on OT^a)- Two vectors b and b' in the same 
chamber define simple polytopes Pb and Py that are equivalent up to combinatorial type. 
The collection of all the chambers is the chamber complex (or chamber system) associated 
with A. Putting all this together we conclude: 

Proposition 1.5.3. Fix a matrix A of size r x c of full row rank r. To represent every 
possible combinatorial type of simple polytope of the form Pb = {x G M c | Ax = b, x > 0}, 
for a fixed A and over varying vector b G W, it is enough to choose one b G M. r from each 
chamber of the chamber complex of A. 



We illustrate Proposition 1.5.3 through an example. The discussion here gives a full 



description of the software transportgen (see [176J). 

Example 1.5.4. Consider the 5x6 defining matrix A of all classical 2x3 transportation 
polytopes. That is: 

1110 
1 1 1 
A= 10 10 
10 10 
1 1 

Up to permutation of coordinates, the system {x G M 6 | Ax = 6, x > 0} dehnes all classical 
2x3 transportation polytopes with marginals given by the vector b G M 5 . 
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.4 



The matrix A does not have full row rank, so instead we remove the last row. Let A2x3 
be the 4x6 matrix 

1110 
1 1 1 
10 10 
10 10 

The columns of the matrix vl2x3 span a four-dimensional cone in IR 5 . It will be relevant 
later that if we slice this cone by an afhne hyperplane (such as Y2i Xi = 1) we obtain the 



2x3 



three-dimensional triangular prism shown in Figure 1.23 but embedded in M 4 . 

The chamber complex can be obtained by slicing the prism with the six planes containing 
a vertex of the prism and the edge "opposite" to it. The resulting chamber complex is 
hard to visualize or draw, even in this small case, but we will see later how to recover the 
structure of the chamber complex for this example using Gale transforms. In particular, as 
we will see, this decomposes the triangular prism into 18 chambers. 

It is very easy to "sample" inside the chamber complex and find chambers of different 
numbers of bases, i.e., transportation polytopes with different numbers of vertices. One 
can simply throw random positive values to the cell entries of a 3-way p x q x s table and 
then compute the 1-marginals or 2-marginals associated to it. But with this method it is 
not obvious how to guarantee that one has obtained all the possible chambers. We want to 
ensure that we have visited every chamber. To do this, we use the following approach based 
on Gale transforms and regular triangulations. (For a detailed treatment on triangulations, 
see [105] .) 

Let A be a vector configuration of c vectors in M r . We usually think of the column 
vectors of a matrix A of size r x c. A vector configuration B of c vectors in W is called a 
Gale transform of the vector configuration A if the row space of the matrix with columns 
given by B is the orthogonal complement in M c of the row space of the matrix with columns 
given by A. (Here, we assume that the matrix A is of full row rank r so that g = c — r.) Gale 
transforms are essential tools in the study of convex polytopes because the combinatorial 
properties of A and B are intimately related (see Chapter 6 in [290] for details). Proofs 
of the following statements can be found in [34J and [137J. (See also Chapters 4 and 5 
of [105], [199] . and [273] .) 
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Lemma 1.5.5. Let A be a vector configuration and A be a Gale transform of A. 

(1) The chambers of A are in bijection with the regular triangulations of the Gale 
transform A of A: From a chamber in A, one can recover a regular triangulation 
of A via complementation, namely for a basis A of vectors in A the elements of 
A not belonging to A form a basis for A. The collection of those bases gives a 
triangulation of A. 

(2) There exists a polyhedron, the secondary polyhedron, whose vertices are in bijection 
with the regular triangulations of the Gale transform A. 

(3) The face lattice of the chamber complex of the vector configuration A is anti- 
isomorphic to the face lattice of the secondary polyhedron of the Gale transform 
A of A. The latter is, in turn, isomorphic to the refinement poset of all regular 
subdivisions of A. 

(4) If the cone generated by A is pointed (for example, if all of its entries are non- 
negative as it is the case for transportation polytopes), then its Gale transform A 
is a totally cyclic vector configuration. (In other words, the cone generated by A is 
all ofR 9 .) 

To summarize, this means: 

Corollary 1.5.6. To enumerate all simple partition polyhedra P of the form P = {i£ 
M c | Ax = b, x > 0} with hxed constraint matrix A and vary vector b £ M. r is the same as 
enumerating all the chambers of the chamber complex of A, which is the same as enumerating 
all of the regular triangulations of A. 



Example 1.5.7. We take again the 4x6 matrix ^2x3 of rank four defined in Example 1.5.4 
A Gale transform -02x3 = ^2x3 0^^2x3 consists of the columns of the 2x6 matrix 



E>2x3 = ^.2x3 



1-1 -110 
1 -1-10 1 



In Figure 1.23 we represent the Gale diagram A2x3 and its 18 regular triangulations, 
each one providing a combinatorial type of non-degenerate 2x3 classical transportation 
polytope, although repeated combinatorial types occur. (A representative vector in each 
of the 18 chambers is chosen in the presentation of the 18 non-degenerate 2x3 classical 
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transportation polytopes presented in |175j . ) The chamber adjacency, which corresponds to 
bistellar flips, is indicated by dotted edges. 




Figure 1.23. The 18 regular triangulations of ^2x3, which correspond to 
the 18 chambers, each defining a non-degenerate 2x3 transportation polytope. 

Thus, generating all the combinatorial types of non-degenerate transportation polytopes 
is the same as listing the distinct regular triangulations of the Gale transform of the defining 
matrix A. In the case of transportation polytopes, the matrix A depends only on the margin 
type (1-marginals, 2-marginals, etc.) and the margin sizes (p, q, s, etc.). 

Now, it is well-known that the regular triangulations of a vector configuration can all be 
generated by applying bistellar flips to a seed regular triangulation (see, e.g., [34], |105j . [290J). 
Bistellar flips are combinatorial operations that transform one triangulation into another 
and regularity of triangulations can be determined by checking feasibility of a certain linear 
program. In our case, the linear program is the very one that defines the polytope An 
example of this linear programming feasibility problem is shown in Example 5.2.10 in [105j . 
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Example 1.5.8 (Example 1.5.4 continued). Consider the only triangulation of Ax 3 with six 



cones (the leftmost one in the middle row in Figure 1.23). The necessary and sufficient con- 
ditions in the non-negative vector a = (a\, ■ ■ ■ , a$) in order to produce this triangulation 
are that each en be smaller than the sum of the two adjacent to it. That is, 

Ol < CZ5 + 06, (22 < 04 + a®, 03 < (24 + 05, 
04 < ci2 + 03, a 5 < a\ + a 3 , a§ < a% + 02- 

Thus, these conditions on the marginals characterize the 2x3 transportation polytopes that 
are hexagons. 

This method is implemented in the C++ program transport gen (see |176 |). This 
program calls TOPCOM (see [239j)> a package for triangulations that computes, among other 
things, the list of all regular triangulations of a given point configuration B. (In our case, 
B is the Gale transform A of A) The program calls polymake (see [136J ) for the Gale 
transform. The input to this program is a matrix A of size r x c. The program outputs one 
vector b G M. r per chamber in the chamber complex of A. The result is a list of transportation 
polytopes, one per chamber, given in the polymake file format (see [136J) representing all 
possible combinatorial types of simple transportation polytopes defined by A. 



1.5.1. Lower and upper bounds via integer programming 

Even for seemingly small cases, such as 3 x 3 x 3 transportation polytopes defined by 1- 
marginals, listing all chambers (and thus all combinatorial types of transportation polytopes) 
is practically impossible. In these cases we can follow a different approach to at least 
obtain upper and lower bounds for the number of vertices of transportation polytopes. By 
the discussion above, this is the same as finding bounds for the number of simplices in 
triangulations of the Gale transform A. Here we follow the method proposed in [96], based 
on the universal polytope. This universal polytope, introduced by Billera, Filliman, and 
Sturmfels in [33] , has all triangulations (regular or not) of a given vector configuration B in 
M 9 as vertices, and projects to the secondary polytope. The universal polytope has much 
higher dimension than the secondary polytope. In fact, its ambient dimension is the number 
of possible bases of the configuration A, thus no more than ( . It has the advantage that 
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the number of simplices in different triangulations is given by the values of a certain linear 
functional t/j. 

More precisely, we think of the chambers inside cone(^4) as the vertices of the following 
high-dimensional 0-1 polytope: Assume A is a vector configuration of c vectors in M r . Let 
N be the number of r-dimensional simple cones in A. We define Ua as the convex hull in 
M. N of the set of incidence 0-1 vectors of all chambers of A. For a chamber T the incidence 
vector vt has coordinates (vt)a = 1 if the basis A G T and {vt)a = if ^4. is not a basis of 
T. The polytope Ua is the universal polytope defined in general by Billera, Filliman, and 
Sturmfels in [33j, although there it is defined in terms of the triangulations of the Gale 
transform of A. 

In |96j , it was shown that the vertices of the universal polytope Ua of A are exactly the 
integral points inside a polyhedron that has a simple inequality description in terms of the 
oriented matroid of A. (See [42j . [96] . or [290 for more on oriented matroids.) The concrete 
integer programming problems in question were solved using the CPLEX Linear Solver™. 
The program to generate the linear constraints is a small C++ program available on the web 
(see [100] ). 



Example 1.5.9. Let A be the 4x6 matrix ^2x3 from Example 1.5.4 Let S denote the set of 



all bases that can be defined in A. Then N = \S\ = (°) = ( 2 ) = 15. So the universal polytope 
Ua is defined in M 15 , where each coordinate is indexed by a 2-subset A of {1, . . . , 6}. Thus 
Ua is the convex hull in ~R N of the incidence vectors vt corresponding to the 18 chambers 



of A. By Lemma 1.5.5, this is equivalent to the convex hull of the incidence vectors vt of 
the 18 triangulations of A. For example, the triangulation T = {{1, 2}, {1, 3}, {2, 4}, {3, 4}} 



in Figure 1.23 gives the incidence vector vt = ^{1,2} + e {i,3} + e {2,4} + e {3,4} (where e 7 is the 
basis unit vector in the direction 7 J as one of the vectors of the convex hull. 

The convex hull of these 18 incidence vectors is a Q-dimensional 0-1 polytope Ua in M 15 . 
That the dimension is (at most ) six follows from the following considerations: 



• Since the pairs {1,4}, {2,5} and {3,6} are not full-dimensional and thus never 
appear as a simplex in any triangulation T of A, the polytope Ua is contained in 
the subspace £{1,4} = £{2,5} = £{3,6} = 0. 
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• Since the vector 1 has 2 and 6 on one side and 5 and 3 on the other, in every 
triangulation the sum Xr^} + £{1,6} equals the sum x^ i3 y + xn 5 \ (and it equals 
zero or one depending on whether the triangulation uses the vector 1 or not). This 
implies the hrst of the following equalities, the rest being the analogous statement 
for the other five vectors: 



X{1,2} 


+ £{1,6} 


" £{1,3} 


- £{1,5} = 





£{2,3} 


+ £{2,4} 


" £{1,2} 


- £{2,6} = 





£{1,3} 


+ X {3,5} 


" £{2,3} 


~ X {3,4} = 





X {3,4} 


+ £{4,5} 


- £{2,4} 


~~ £{4,6} = 





^{1,5} 


+ £{5,6} 


- £{3,5} 


- X{ 4>5 } = 





^{2,6} 


+ £{4,6} 


- £{1,6} 


- £{5,6} = 






Observe that one of these equations is redundant, since the sum of the left-hand 
sides is already zero. 

• Since every triangulation needs to cover the angle between, for example, vectors 1 



and 6, and this angle is covered only by the cones 16, 12 and 56 (see Figure 1.23), 
we have that 

£{1,6} +£{1,2} +£{5,6} = 1- 

The results in [96] say that the polytope Ua is the convex hull of the non-negative integer 
points in M 15 satisfying this list of equations. 

We now denote by i[) : M> N — > M the cost vector dehning the linear functional 

tp(x) = (1,1,. . ., 1) -X = ^ x A . 

AeS 

Then the values ip(x) of the linear functional ip on Ua H {0, 1}^ are the only possible values 
for the number of vertices of non-degenerate polytopes of the form Pj, = {x G M c | Ax = 
b, x > 0}. In particular, the solutions to the linear programming relaxations: "minimize 
(respectively maximize) ip(x) subject to x G Ub" give lower (respectively upper) bounds 
to the possible values for the number fo of vertices of non-degenerate polytopes of the 
form Pi, = {x G M c | Ax = b, x > 0}. In the running example, 3 < ip(x) < 6 whenever 
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x £ Uj^C\{0, 1} N . From Table A.l we observe that the number of vertices of a non-degenerate 



2x3 transportation polytope equals 3, 4, 5, or 6. 
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CHAPTER 2 

The Hirsch Conjecture Today 

The Hirsch Conjecture was posed in 1957 in a letter from Hirsch to Dantzig (see page 
168 of |88j). Besides its implications in linear programming, which motivated the conjecture 
in the first place, it is one of the most fundamental open questions in combinatorial geometry. 
It states that the graph of a d-dimensional polytope with n facets cannot have diameter 
greater than n — d. That is to say, we can go from any vertex of the polytope to any other 
vertex using at most n — d edges. 

Conjecture 2.0.1 (Hirsch Conjecture). Let n > d > 2. Let P be a d-dimensional polytope 
with n facets, and let G(P) be its graph. Then diam(G(P)) < n — d. 

Remark 2.0.2. See Appendix \C\ for a very recent update on the status of the conjecture. 
Santos (see |245j ) announced the construction of a counter-example to the Hirsch Conjecture 
as this dissertation was being submitted. 

Despite being one of the oldest and most basic problems in polyhedral combinatorics, 
what we know is quite scarce. Most notably, no polynomial upper bound is known for the 
diameters of polytopes. That is to say, the following Polynomial Diameter Conjecture is 
open: 

Conjecture 2.0.3 (Polynomial Diameter Conjecture). Is there a polynomial function f(n, d) 
such that for any polytope (or polyhedron) P of dimension d with n facets, diam(G(P)) < 
f(n,d)?. 

In contrast, very few polytopes are known where the bound n — d is attained. This 
chapter surveys the state of the art on the Hirsch Conjecture as of today. An earlier survey 
on the geometry of the Hirsch Conjecture was written by Klee and Kleinschmidt (see |181j ) 
in 1987. (On a related note, the surveys by Megiddo (see [213J) and Todd (see |275|) 
describe the complexity of linear programming algorithms.) This chapter is based on a new 
survey written jointly with Santos (see [178J ) . 



Any polytope or polyhedron P, given by its facet-description, can be perturbed to a 
simple one P' by a generic and small change in the coefficients of its defining inequalities. 
This will make non-simple vertices "explode" and become clusters of new vertices, all of 
which will be simple. This process cannot decrease the diameter of the graph, since we can 
recover the graph of P from that of P' by collapsing certain edges. 

Proposition 2.0.4. The diameter of P' is an upper bound on the diameter of P. 

Hence, to study the Hirsch Conjecture, one only needs to consider simple polytopes: 

Lemma 2.0.5. The diameter of any polytope P is bounded above by the diameter of some 
simple polytope P' with the same dimension and number of facets. 

Graphs of simple polytopes are better behaved than graphs of arbitrary polytopes. Their 
main property in the context of the Hirsch Conjecture is that if u and v are vertices joined 
by an edge in a simple polytope then there is a single facet containing u and not v, and a 
single facet containing v and not u. That is, at each step along the graph of P we enter a 
single facet and leave another one. 

Definition 2.0.6. Let H(n, d) denote the maximum diameter of graphs of d-polytopes with 
n facets. Let H u (n, d) denote the maximum diameter of graphs of d-polyhedra with n facets. 

Remark 2.0.7. The Hirsch Conjecture is, then, the assertion that H(n, d) < n — d for all 
n > d > 2. Since all polytopes are polyhedra, clearly H(n, d) < H u (n, d) for all pairs (n, d) 



with n > d > 2. By Lemma 2.0.5 the quantity H(n,d) is, equivalently, the maximum 



diameter of graphs among simple d-dimensional polytopes with n facets. By Remark 1.2.6, 
this is also the maximum diameter among the dual graphs of simplicial d-polytopes with n 
vertices. 

In the rest of this chapter, we survey the status of the Hirsch Conjecture today. In 



Section 2.1 we discuss the connection between the Hirsch Conjecture and linear programming. 

A polytope is called Hirsch-sharp if it meets the Hirsch Conjecture with equality. In 
other words, a d-dimensional polytope with n facets is Hirsch-sharp if its diameter is exactly 



n — d. Section pT2| discusses Hirsch-sharp polytopes. In Section 2.2.2| we describe a special 



Hirsch-sharp polytope first discovered by Klee and Walkup in their seminal 1967 paper |183| . 



In Section 2.2.3, we present useful operations which preserve Hirsch-sharpness. Section 2.2.4 



2.1. THE HIRSCH CONJECTURE AND LINEAR PROGRAMMING 



59 



surveys the work of Fritzsche, Holt, and Klee (see |133j . [159], and [160 ) which prove that 
Hirsch-sharp (i-polytopes with n facets exist whenever n > d > 7. 



Section 2.3 surveys results in support of the Hirsch Conjecture, or at least the weaker 



Polynomial Diameter Conjecture (see Conjecture 2.0.3). Section 2.3.1 surveys results of 
Klee and Walkup (see |183j ). proving the equivalence of the Hirsch Conjecture to two other 



natural conjectures, the d-step and the Non-revisiting Conjectures. Section 2.3.2 surveys 
the cases of small d or n — d, which are some cases where the Hirsch Conjecture is known 



to hold. Section 2.3.3 presents general upper bounds on the diameters of all polytopes. 



Section 2.3.4 discusses the recent work of Deza, Terlaky, and Zinchenko (see [109] , |110j . 
and |108j ) on a continuous analogue of the Hirsch Conjecture arising in the context of the 



central path method for linear programming and convex optimization. Section 2.3.5 surveys 
known diameter bounds for special classes of polytopes, including transportation polytopes. 



We close this chapter with Section 2.4, which surveys evidence against the Hirsch 
Conjecture (or the Polynomial Diameter Conjecture should the Hirsch Conjecture turn out 
to be false). This section proves that three natural variants of the Hirsch Conjecture are false. 



Section |2.4.1 presents the work of Klee and Walkup in |183j which shows the Unbounded 
Hirsch Conjecture is false. We also show Todd's result (see [274]) that the Monotone Hirsch 



Conjecture is false. Section 2.4.2 summarizes the work of Mani and Walkup in [208], where 
they prove that the Topological Hirsch Conjecture is false. 



2.1. The Hirsch Conjecture and linear programming 

The original motivation for the Hirsch Conjecture comes from its relation to the simplex 
algorithm for linear programming. The surveys by Megiddo (see |213| ) and Todd (see |275j ) 
describe the complexity of linear programming algorithms. In 1979, Khachiyan (see |174|) 
proved that linear programming problems can be solved in polynomial time on the input 
size of the polyhedron via the ellipsoid method for linear programming. In 1984, Karmarkar 
(see |172j ) devised the interior point method for linear programming. (See [241] for a 
complete treatment of interior point methods.) 

Although the latter is more applicable (it is easier to implement and has better complexity) 
than the former, to this day the most commonly used method for linear programming is 
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the simplex method, devised by Dantzig in 1947. Dongarra and Sullivan regard the simplex 
method among the top ten algorithms of the twentieth century (see |114j ). In geometric 
terms, the simplex method first finds an arbitrary vertex in the feasibility polyhedron. Then, 
it uses local rules to move from vertex to adjacent vertex in such a way that the value 
of the given linear functional £ increases at every step. When there is no such pivot step 
that can increase the functional, convexity implies that we have achieved the maximum 
possible value of it. (For a practical explanation of how to implement the simplex method, 
see the book [72J.) 

Bounding the diameter of graphs of polytopes has received a lot of attention because of 
its connection to the performance of the simplex method for linear programming. Clearly, a 
lower bound for the performance of the simplex method under any pivot rule is the diameter 
of the polyhedron P. The converse is not true, since knowing that P has a small graph 
diameter does not in principle tell us how to go from one vertex to another in a small 
number of steps. In particular, many of the results on diameters of polyhedra presented 



later (e.g., Theorems 2.3.9 and 2.3.23) do not have direct implications for the efficiency of 
the simplex method: The proofs construct a short path pairs of vertices only after specifying 
the coordinates of both vertices! 

Remark 2.1.1. More relevant in the context of the simplex method is the proof that there are 
"randomized" pivot rules that get to the optimum vertex in subexponential time. The exact 



2.3.14 



bound is e K ^ dlo ^ n , where K is a fixed constant (see [T68J and [211]). See Theorem 
Also interesting are results by Spielman and Teng (see [255JJ and improvements by Vershynin 
(sec [280JJ saying that "random polytopes" have polynomial diameter. More precisely: any 
polytope can be perturbed to have a diameter that is polynomial in the values of n, d, and 



the inverse of the perturbation parameter (see Theorem 2.3.15). This result seems to explain 
why the simplex method works well in practice. 

In fact, the complexity of the simplex method depends on a local rule (known as a 
pivot rule) chosen to move from vertex to vertex. (In the survey [272J, Terlaky and Zhang 
study the finiteness of the simplex algorithm under various pivot rules.) The a priori "best" 
pivot rule is "move to the neighbor where the functional increases most," but Klee and 
Minty (see |182| ) showed in 1972 that this can lead to paths of exponential length, even in 
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polytopes that are combinatorially equivalent to cubes. The same worst-case exponential 
behavior has been proved for nearly every deterministic rule devised so far, although not 
for all of them. However, there are subexponential, but not yet polynomial, randomized 



pivot algorithms (see Theorem 2.3.14). Still, the simplex algorithm is highly efficient in 
practice on most linear optimization problems. (See |37j for more on practical performance 
in solving linear programs.) 

There is another reason why investigating the complexity of the simplex method is 
important, even if we already know polynomial time algorithms. The algorithms of Khachiyan 
and Karmarkar are polynomial in the bit length of the input. (For a discussion of bit length, 
see Section 3.2 of |247j .) However, it would be interesting to know whether a polynomial 
algorithm for linear programming in the real number machine model of Blum, Cucker, Shub, 
and Smale (see [41] ) exists. That is to say, is there an algorithm that uses a polynomial 
number of arithmetic operations on the coefficients of the linear program, rather than on 
their bits; or, better yet, a strongly polynomial algorithm, i.e., one that is polynomial both in 
the arithmetic sense and the bit sense? These two related problems were included by Smale 
(see [251] ) in his list of mathematical problems for the next century. A polynomial pivot 
rule for the simplex method would solve them in the affirmative. 

In this context, the much weaker statement asserting the existence of a polynomial 
upper bound in n and d is relevant (see the Polynomial Diameter Conjecture, stated as 
Conjecture |2. 0.3 on page 57), if the linear one turns out to be false (see, e.g., the discussion 



in |169j). The best bound for all polytopes is a quasi-polynomial bound by Kalai and 



Kleitman in [170J. We present their result later as Theorem 2.3.9 



Remark 2.1.2. Many of the algorithms for linear programming discussed here make sense 
(with some modihcations) in the larger context of convex optimization. See, e.g., |48j, [241], [242] . 
and [2"5B] . 

2.2. Hirsch-sharp polytopes 



Recall that a o?-polytope (or polyhedron) with n facets is called Hirsch-sharp if it meets 
the Hirsch Conjecture with equality; that is, if its diameter is exactly n — d. Here we 
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show several ways to construct Hirsch-sharp polytopes. Our current knowledge is that 
Hirsch-sharp d-polytopes with n facets: 

• exist if one of the following three conditions holds: n < 2d, n < 3d — 3, or d > 7. 

• do not exist for d < 3 if n > 2d, or for (n, d) G {(10, 4), (11, 4), (12, 4)}. 

• are unknown, but may exist in all other cases: that is, if d G {4, 5, 6} and n > 3d — 3, 
except for the three pairs (n, 4) mentioned above. 

However, all Hirsch-sharp polytopes with n > 2d that are known are obtained from one 
particular Hirsch-sharp polytope, by simple geometric operations of wedging, truncating 



and gluing, as we show in Section 2.2.4 That is to say: to some extent, we only know 
one non-trivial Hirsch-sharp polytope, the four- dimensional polytope discovered by Klee 



and Walkup in their seminal 1967 paper |183j . In Section 2.2.2 we describe this particular 
Hirsch-sharp polytope. Our description gives integer coordinates for it much smaller than 
the original ones. 



2.2.1. Easy constructions 

Let us start with the most basic Hirsch-sharp polytopes. 

(1) Cubes. The d-dimensional cube has 2d facets. The distance between vertices 
is the Hamming distance (i.e., number of different coordinates); thus the diameter 
of Djj is d. 

(2) Products. One reason that the Hirsch Conjecture is natural is that it is "invariant" 
under taking products. If P and Q meet the Hirsch Conjecture with equality, then 
their Cartesian product P x Q does the same. Indeed, both the dimension and 
number of facets of P x Q are the sum of those of P and Q. The diameter of the 
product is the sum of the diameters: if we want to go from vertex (ui,vi) to vertex 
(u2, V2) we can do so by going from (ui, v±) to (u2, v 1) along P x {v{\ and then to 
(u2,t>2) along {1x2} x Q] and there is no better way. 

(3) Products of simplices. In particular, any product of simplices of any dimension 
satisfies the Hirsch Conjecture with equality. Let P = Aj x x • • • x Aj fc be the 
product of k simplices. The dimension of P is Y2j=i hi P nas Y2j=i(h + 1) facets, 
and its diameter is k. 
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Corollary 2.2.1. For every d < n < 2d there are simple d-polytopes with n facets 
and diameter n — d. 

Proof. Let k = n — d < d and let i\, . . . , be any partition of d into k positive 
integers (that is, i\ H + ik = d). Let P = x • • • x Aj fe . □ 

(4) Polyhedra with n < 2d. There is another way to construct Hirsch-sharp polytopes 
when d < n < 2d. Let k = n—d and u be the origin in R d . Let v = (1, . . . , 1, 0, . . . , 0) 
be the point whose first k coordinates are 1 and whose remaining d — k coordinates 
are 0. 

Consider the polytope P defined by the following d + k inequalities: 

xi > 0, Vz; 4>j{x) > 0, j = 1, . . . , k, 

where the ipi are affine linear functionals that vanish at v and are positive at u. No 
matter what choice we make for the ^-'s, as long as they are sufficiently generic to 
make P simple, P will have diameter (at least) k; to go from v to u we need to 
enter the k facets Xj = 0, j = 1, . . . , k, and each step gets you into at most one of 
them. 

In principle, P may be an unbounded polyhedron; but if one of the ipj's is, say, 
k — x ii then it will be bounded. 

So, Hirsch-sharp polytopes with n < 2d are easy to find. We consider them "trivial" 
Hirsch-sharp polytopes. In the following sections we show examples of "non-trivial" ones. 
But before that let us mention that unbounded Hirsch-sharp polyhedra with any number of 
facets are also easy to obtain: 

Proposition 2.2.2. For every n > d there are simple unbounded d-polyhedra with n facets 
and diameter n — d. 

Proof. The proof is by induction on n, the base case n = d being the orthant IR> = 
{x G R d | Xi > 0, Vi}. Our inductive hypothesis is not only that we have constructed a 
d-polyhedron P with n — 1 facets and diameter n — d — 1; also, that vertices u and v at 
distance n — d — I exist in it with v incident to some unbounded ray I. Let H be a supporting 
hyperplane of I, and tilt it slightly at a point v' in the interior of I to obtain a new hyperplane 
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H' . (See Figure [2~Tj ) Then, the polyhedron P' obtained cutting P with the tilted hyperplane 
has n facets and diameter n — d; v is the only vertex adjacent to v in the graph, so we need 
at least 1 + (n — d — 1) steps to go from v' to u. □ 




2.2.2. The Klee-Walkup polytope Q 4 

In their seminal 1967 paper |183j on the Hirsch Conjecture and related issues, Klee and 
Walkup describe a 4-dimensional polytope called with nine facets whose diameter is five. 
Innocent as this might look, this first "non-trivial" Hirsch-sharp polytope is at the basis of 



the construction of every remaining Hirsch-sharp polytope known to date (see Section 2.2.4). 
It is also instrumental in disproving the unbounded and monotone versions of the Hirsch 
Conjecture, which we will discuss in Section 2.4.1[ Moreover, its existence is something of 



an anomaly: Altshuler, Bokowski, and Steinberg (see [8]) list all 1296 combinatorial types of 
simplicial spheres with nine vertices. Of them, 1142 are polytopal (that is, are the boundary 
complex of some polytope) and the polar of is the only one that is Hirsch-sharp. 

We prefer to describe the polytope in the polar view (that is, we will describe Q^), 
where it is simplicial instead of simple and the diameter is considered for the polar graph 



G (Q4), which is we know by Part Ml) of Lemma 1.2.4 is isomorphic to the graph G{Qi) of 



Q4. So, let be the convex hull of the following nine points in M 4 . The coordinates here 
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are much smaller than the original ones in [183J: 

a ■= (-3,3,1,2), e := (3,3,-1,2), 

6:= (3,-3,1,2), /:= (-3,-3,-1,2), 

c:= (2,-1,1,3), g := (-1,-2, -1,3), 

d:= (-2,1,1,3), h:= (1,2,-1,3), 
w := (0,0,0,-2). 



The polar (simple) polytope = Qf A of (see Part §2y of Lemma 1.2.4) is obtained 
by converting each vertex v of the simplicial polytope into an inequality v ■ x < 1. For 
example, the inequality corresponding to vertex a below is — 3a; i + 3x2 + ^3 + 2^4 < 1. 
The key property of the polytope is: 

Theorem 2.2.3 (Klee-Walkup [183J). Any path in the simplicial polytope Qf from the 
tetrahedron abed to the tetrahedron efgh needs at least five steps. 

To prove this, you may simply input these coordinates into any software able to compute 
the (polar) graph of a polytope. Our suggestion for this is polymake (see [136 J). However, 
understanding this polytope can be the key to the construction of counter-examples to the 
Hirsch Conjecture, so it is worth presenting the following hybrid computer- human proof. 

Proof. When pivoting from the tetrahedron abed to the tetrahedron efgh we already 
need four steps to introduce, one by one, the four vertices e, /, g and h. Hence, all paths that 
use the extra vertex w have necessarily length five of bigger. This means we can concentrate 
on the subcomplex K of dQ^ consisting of tetrahedra that do not use the vertex w. This 
subcomplex is called the anti-star of w in dQf~. We claim that this subcomplex consists of 



the 15 tetrahedra in Figure 2.2 



Figure 2.2 shows adjacencies among tetrahedra; that is, it is the polar graph of K. 
Observe that abed and efgh are repeated in the figure, to better reflect symmetry. From 
the picture it is easy to conclude our statement: there is no tetrahedron that can be reached 
in two steps from both abed and efgh, so the diameter is at least five. □ 

From the picture we can also see which tetrahedra of dQ^ use the vertex w: there is 
one for each triangle that appears only once in the list. For example, since the tetrahedron 
abed is adjacent only to acde and bedf, the triangles abc and bed are joined to the vertex w. 
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abed 

acde 

I \ 

adeh — cdeh — bceh — begh — efgh 

adgh — edgh — begh 

efgh afgh — adfg cdfg befg 

\ I 
bedf 

abed 

Figure 2.2. The polar graph of the subcomplex K 
The boundary of the anti-star of the vertex w, that is, the link of w in turns out to be, 



combinatorially, the triangulation of the boundary of a cube displayed in Figure 2.3 The 







b / 


V 









Figure 2.3. The link of w in is combinatorially a triangulation of the 
boundary of a cube. 



anti-star K of w in the boundary dQf' of is a topological triangulation of the interior of 
the cube. But we need to deform the cube a bit to realize this triangulation geometrically. 



This is shown in Figure 2.4 the quadrilaterals abed and efgh are displayed separately as 
lying in two different horizontal planes (so that the two relevant tetrahedra abed and efgh 
degenerate to flat quadrilaterals), and the central part of the figure shows the intersection of 
the complex K with their bisecting plane. Tetrahedra with three points on one plane and 
one in the other appear as triangles and tetrahedra with two points on either side appear as 
quadrilaterals. The tetrahedra abed and efgh do not show up in the figure, since they do 
not intersect the intermediate plane. For the interested reader, this picture is an example of 
a mixed subdivision of the Minkowski sum of two polygons. The fact that triangulations 
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of polytopes with their vertices lying in two parallel hyperplanes can be pictured as mixed 
subdivisions is the polyhedral Cayley trick (see, e.g., |244j or Chapter 9 of [105J). 




Figure 2.4. The Klee-Walkup complex as a mixed subdivision. The shad- 
owed triangles represent tetrahedra adjacent to abed and efgh 

2.2.3. Wedge and one-point suspension 

Here we describe a very basic, yet extremely fruitful, operation that one can do to a polytope. 
In the simple version it is called wedging and its simplicial counterpart (i.e., in the polar 
setting) is the one-point suspension. We will prove that the Hirsch Conjecture is invariant 
under wedges and truncations. (To simplify our exposition, in this section we assume that 
all our polytopes are full-dimensional, i.e., the ambient dimension c is always equal to the 
dimension d.) 

Wedging 

Let F be a facet of a polytope P, and let f(x) < b be the inequality corresponding to F. 
The wedge of P over F is the polytope 

W F {P) :=Px [0, oo) n{(i,t)eR tl xK: f(x) +t<b}. 

Put differently, Wj?(P) is formed by intersecting the half-cylinder C := P x [0,oo) with a 
closed half-space J in such that: 

• the intersection J n C is bounded and has nonempty interior, and 

• the boundary hyperplane H := d J is such that H n C = F. 
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Lemma 2.2.4. Let P be a d-polytope with n facets. Let Wjr(P) be its wedge on a certain 
facet F. Then, W_f(P) has dimension d + 1, n + 1 facets, and 

diam(W F (P)) > diam(P). 

Proof. The wedge increases both the dimension and the number of facets by one. 
Indeed, the polytope Wj?(P) has a vertical facet projecting to each facet of the polytope 
P other than the facet F, plus the two facets that cut the cylinder PxR, and whose 



intersection projects to F. See Figure 2.5 for an example. 





Figure 2.5. A 5-gon and a wedge on its top facet 



For the diameter, since every edge of the polytope Wj?(P) projects either to an edge of 
P or to a vertex of P, the diameter of Wf(P) is at least that of P. □ 



In particular, if the polytope P is Hirsch-sharp, then the polytope W^(P) is either 
Hirsch-sharp or a counter-example to the Hirsch Conjecture. The properties that P would 



need for the latter to be the case will be made explicit in Remark 2.3.5 



One-point suspension 

It will be useful to consider the same operation in the polar setting, where it is known as the 
one-point suspension. We refer the reader to Section 4.2 of |105| for an expanded overview. 
Let w be a vertex of the polytope P. The one-point suspension of P C M d at the vertex w 
is the polytope 



S W (P) ■= conv ((P x {0}) U ({w} x {-1, +1})) C 



id+l 
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That is, S W (P) is formed by taking the convex hull of P (in an ambient space of one higher 



dimension) with a "raised" and "lowered" copy of the vertex w. See Figure 2.6 for an 
example. 




Figure 2.6. The simplicial version of Figure 2.5 a 5-gon and a one-point 
suspension on its topmost vertex 



Recasting Lemma 2.2.4 in the polar setting gives: 

Lemma 2.2.5. Let P be a d-polytope with n vertices. Let S W (P) be its one-point suspension 
on a certain vertex w. Then S W (P) is a {d + l)-dimensiona] polytope with n + 1 vertices, 
and the diameter of the polar graph G A (S W (P)) of S W (P) is at least the diameter of the 
polar graph of P. 

Since the wedge of a simple polytope produces a simple polytope, the one-point suspension 
of a simplicial polytope is a simplicial polytope. In fact, the one-point suspension can be 
described for abstract simplicial complexes: Let L be a simplicial complex and w a vertex of 
it. Recall that the anti-star asti^) of w is the subcomplex consisting of simplices not using 
w and the link Yk.L(w) of w is the subcomplex of simplices not using w but joined to w. If L 
is a piecewise-linear /c-sphere, then ast^(w) and lkk(u>) are a /c-ball and a (k — l)-sphere, 
respectively. The one-point suspension of L at w is the following complex: 



(2.1) 



S W (L) := (asti(tt;) * w±) U (asti / (u;) * W2) U (lk/^ii;) * wiu^)- 



Here * denotes the join operation: L * K has as simplices all joins of one simplex of L and 



one of K . In Figure 2.6 the three parts of the formula are the three triangles using w\ but 
not W2 , the three using W2 but not w\ , and the two using both, respectively. 

In the next section we will make use of an iterated one-point suspension. That is, in 
S W (P) we take the one-point suspension over one of the new vertices w\ and W2, then again 
in one of the new vertices created, and so on. We leave it to the reader to check that, at 
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the level of simplicial complexes, the one-point suspension iterated k times on the simplicial 
complex L produces the simplicial complex S W (L)^ below, where is a /c-simplex with 
vertices wi, . . . , Wk+i and dA k is its boundary: 

S W (L)W ■= (ast L H * 8A k ) U (\k L (w) * A fe ). 



Observe that this generalizes the formula for S^(L) presented above in (2.1). 
2.2.4. Many Hirsch-sharp polytopes 



In Section 2.2.2| we saw the first example of a non-trivial Hirsch-sharp polytope, the Klee- 



Walkup 4-polytope with 9 facets. In this section we see constructions of other Hirsch-sharp 
polytopes, which together prove the following: 

Theorem 2.2.6 ([1331 H601 H59j ). Hirsch-sharp d-polytopes with n facets exist in at least 
the following cases: 



(1) n < 2d; 

(2) n < 3d — 3; and 

(3) d > 7. 



The "trivial" case n < 2d was shown in Section |2.2.1[ The case n < 3d — 3 was first 
proved in 1998 in |159j . together with the case d > 14. The latter was then improved 
to d > 8 in |133j . and to d > 7 in |160j . We also know that Hirsch-sharp polytopes of 



dimensions two and three exist only when n < 2d (see Section 2.3.2). But the existence of 
Hirsch-sharp polytopes with many facets in dimensions four to six remains open. 

Open Problem 2.2.7. Are there Hirsch-sharp A-polytopes with 13 or more facets? Are 
there Hirsch-sharp 5-polytopes with 13 or more facets? Are there Hirsch-sharp 6-polytopes 
with 16 or more facets? 

We do know that they do not exist in dimension four with 10, 11 or 12 facets, since 
Goodey (see [142j ). Schuchert (see 248 ). and Bremner et al. (sec |49j) proved that 
#(10, 4) = 5, #(11,4) = 6 and #(12,4) = 7. (Recall that H(n,d) is the maximum diameter 
among ci-polytopes with n facets.) 
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Part ([2]) of Theorem 2.2.6 follows from the iterated application of the next lemma to the 



Klee-Walkup polytope Q4. 

Lemma 2.2.8 (Holt-Klcc [159J). If there are Hirsch-sharp d-polytopes with n > 2d facets, 
then there are also Hirsch-sharp (d + l)-polytopes with n + 1, n + 2, and n + 3 facets. 

Proof. Let u and v be vertices at distance n — d in a simple d-polytope with n facets. 
Let F be a facet not containing any of them, which exists since n > 2d. When we wedge on 
F we get two edges U1U2 and v\V2 with the properties that the distance from any m to any 
Vi is again (at least) d. We can then truncate one or both of u± and v\ to obtain one or two 



more facets in a polytope that is still Hirsch-sharp. See Figure 2.7 □ 





Figure 2.7. After wedging in a Hirsch-sharp polytope we can truncate twice 



The next construction is more natural in the simplicial framework. So, as a warm-up, 



we include (see Figure 2.8) the simplicial version of Figure 2.7 We already said that the 
polar of wedging is one-point suspension. The polar of truncation of a vertex is the stellar 
subdivision of a facet by adding to our polytope a new vertex very close to that facet. 

Remark 2.2.9. The stellar subdivision is also known as stacking. See, e.g., page 621 in [215j . 





Figure 2.8. The simplicial version of Figure 2.7 Wedging becomes one-point 
suspension and truncation is stellar subdivision 



The key property in the proof of Lemma 2.2.8 is that the wedge and one-point suspension 
operations do not only preserve Hirsch-sharpness; they also increase the number of vertices 
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or facets (respectively) that are at Hirsch distance from one another. This suggests looking 
at what happens when we iterate the process. 

Lemma 2.2.10 (Holt-Klee [159J). Let P be a simplicial d-polytope with more than 2d 
vertices. Let A and B be two facets of it at distance 5 in the polar graph and let w be a 
vertex not contained in any of them. Let P^ be the kth one-point suspension of P on the 
vertex w. 

Then, P^ has two (k + l)-tuples of facets {Ai, . . . , Ak+i} and {B\, . . . , B k +i} with 
every Ai at distance from 5 every B>i . All the facets in each tuple are adjacent to one another. 



Proof. We use the following formula, from Section 2.2.3, for the iterated one-point 
suspension of the simplicial complex L = dP: 

S w (L) (k) := (astz» * dA k ) U (lk L (w) * A k ). 

Here is a /c-simplex. The two groups of facets in the statement are A * dA k and B * dA k . 
The details are left to the interested reader. □ 

This is the basis for the following result of Fritzsche and Holt in [133] . From this, we 
can get the same for every d > 8 via wedging. The result has been improved to dimension 
d = 7 by Holt (see |160j ) with a generalization of these same arguments, but we skip this 



part since it is a bit more technical. That proves part 3 of Theorem 2.2.6 



Corollary 2.2.11 (Fritzsche- Holt |133j ). There are Hirsch-sharp 8-polytopes with any 
number of facets. 

Proof. We look at a new operation on polytopes. We call it gluing and it is simply a 
combinatorial/geometric version of the connected sum of topological manifolds (see Chapter 3 
of [154| ). Let P\ and P2 be two simplicial d-polytopes and let F\ and F2 be respective facets. 
The manifolds are dP\ and SP2 (two (d — l)-spheres); from them we remove the interiors 



of F\ and F2 after which we glue their boundaries. See Figure 2.9, where the operation 
is performed on two facets of the same polytope. On the top part we glue the polytopes 
"as they come," which does not preserve convexity. But if projective transformations are 
made on Pi and P2 that send points that are close to F\ and F2 to infinity, then the gluing 
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preserves convexity, so it yields a polytope that we denote P\^P<i- This is shown on the 



bottom part of Figure 2.9 




Figure 2.9. Gluing two simplicial polytopes along one facet. In the version 
of the bottom, a projective transformation is done to Pi and P2 before gluing, 
to guarantee convexity of the outcome 



Gluing almost adds the diameters of the two original polytopes. Suppose that the facets 
F\ and F2 are at distances 5\ and 82 to certain facets F[ and F' 2 of Pi and P%- Then, to go 
from F{ to F' 2 in Pi#p2 we need at least (8\ — 1) + 1 + [8% — 1) = <?>i + £2 — 1 steps. 

But we can do better if we combine gluing with the iterated one-point suspension. 
Consider the simplicial Klee-Walkup 4-polytope Qf~ described in Section 



2.2.2 



and let A and 



B two facets of it at distance five. Let P' be the fourth one-point suspension of it on the 
vertex w not contained in AL)B. Observe that the polytope P' has 13 vertices and dimension 
eight. By the lemma, P' has two groups of five facets, {Ai, . . . , A§} and {Pi, . . . , P5}, with 
every Ai at Hirsch distance from every Pj and all the facets in each group adjacent to one 
another. 

We now glue several copies of P' to one another, a Pj from each copy glued to an Ai of 
the next one. Each gluing adds five vertices and, in principle, four to the diameter. But 



Lemma 2.2.10 implies the following nice property for P': half of the eight facets adjacent to 
each Ai are at distance four to half of the facets adjacent to each Pj. Using the language of 
Fritzsche, Holt, and Klee (in |133j . [T59j. and [160]), we call those facets the slow neighbors 
of each A4 or Pj, and call the others fast. Since half of the total neighbors are slow, we 
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can make all gluings so that every fast neighbor is glued to a slow one and vice-versa. This 
increases the diameter by one at every gluing, and the result is Hirsch-sharp. 

The above construction yields Hirsch-sharp polytopes of dimension eight with n = 13 + 5A; 
vertices, for every k > 0. We can get the intermediate values of n too, via truncation. By 



Lemma 2.2.8, every time we do a one-point suspension on a Hirsch-sharp simplicial polytope 
we can increase the number of facets by one or two via a stellar subdivision at each end. 
Since the polytope P' we are gluing is a four-fold one-point suspension, and since there are 
two ends that remain unglued (the A- face of the first copy and the U-face of the last) we 
can do up to eight stellar subdivisions to it and still preserve Hirsch-sharpness. □ 



Remark 2.2.12. Holt, Klee, and Fritzsche prove these results via simple polytopes, rather 
than simplicial. The analogue of "gluing along a simplicial facet" is called "blending at 
a simple vertex" in [133J, [159], and [160]. The trick in the proof is called "fast-slow 
blending. " 



2.3. Positive results 



In this section, we give evidence for the Hirsch Conjecture, or at least its polynomial version, 
by presenting special cases for which it holds, three upper bounds for diameters of polytopes 
(one after perturbation), and its equivalence to two other seemingly natural conjectures. We 
also summarize some results on a continuous analogue of the conjecture. 



2.3.1. The d-step and Non-revisiting Conjectures 

The equivalence of the Hirsch Conjecture to other two conjectures that we discuss in this 
section shows that the conjectured bound of n — d is a natural bound to consider. 

We start with the d-step Conjecture. This is simply the Hirsch Conjecture restricted to 
polytopes whose number of facets is twice their dimension: 

Conjecture 2.3.1 (The d-step Conjecture). For every d>2, H(2d,d) < d. 

At first, it is seemingly just a special case of the Hirsch Conjecture. Its equivalence to 
the full Hirsch Conjecture follows from: 
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Theorem 2.3.2 (Klee-Walkup [183J). Let k be fixed. Then, 



xaaxH(k + d,d) = H(2k,k). 



In particular, the Hirsch Conjecture (Conjecture 2.0.1) holds if and only if the d-step 



Conjecture (Conjecture 2.3.1) holds 



Proof. We are going to show that: 



(2.2) 

and 

(2.3) 



< H(2k + 2,k + 2)< H(2k + l,k + l)< H(2k, k) 



< H(2k - 2, k - 2) < H(2k - 1, k - 1) < H(2k, k) < H(2k + l,k + l)< 



In (2.2), we are looking at polytopes with n < 2d. In (2.3), n and d are arbitrary. 



To prove (2.2), let P be a polytope with n < 2d and let u and v be vertices of it. Then 



u and v have some facet F in common, since each vertex is incident to at least d facets. The 
facet F has dimension d — 1, and each facet of it is the intersection of F with another facet 



of P. Hence, F has at most n — 1 facets. Since every path on F is a path on P, we get (2.2). 



For (2.3), we utilize the wedge operation. As we said in Lemma 2.2.4 this operation 



applied to P increases the dimension and number of facets by one, and it gives a polytope 
with the same or bigger diameter. □ 

One can interpret the Hirsch Conjecture as saying that if one wishes to go from vertex u 
to vertex v of a polytope P, one does not expect to have to enter and leave the same facet 
several times. This suggests the following conjecture: 

Conjecture 2.3.3 (The Non-revisiting Conjecture). Let P be a simple polytope. Let u 
and v be two arbitrary vertices of P. Then, there is a path from u to v which at every step 
enters a different facet of P. 

The Non-revisiting Conjecture asserts that for every two vertices u and v of a polytope P, 
there is a path in the graph of P that never revisits a facet that it has previously abandoned. 
Paths with the conjectured property are called non-revisiting paths. These paths are also 



called W v -paths and Conjecture 2.3.3 is also known as the W v Conjecture. The length of 
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non- revisiting paths is bounded by n — d: at each step we must enter a different facet, 
and the d facets that our initial vertex lies in cannot be among them. Klee and Walkup 
(sec [183 ) proved the Non-revisiting Conjecture is equivalent to the Hirsch Conjecture. We 
outline the proof, beginning with the following lemma: 

Lemma 2.3.4. Let n denote the number of facets of a polytope P and let d denote its 
dimension. The following properties are equivalent: 

(1) Every simple polytope P has the non-revisiting property. 

(2) Every simple polytope P with n = 2d has the non-revisiting property. 

Proof. One direction is obvious. For the other, let P be a polytope with n ^ 2d and 
suppose it does not have the non-revisiting property. That is, there are vertices u and v 
such that every path from u to v revisits some facet that it previously abandons. We will 
construct another polytope P' without the non-revisiting property and with: 

• One less facet and dimension than P if n < 2d, and 

• One more facet and dimension than P if n > 2d. 



F 




Figure 2.10. The pentagon P and the wedge P' = Wj?(P) over its facet F: 
the upper pentagonal facet of P' is F\ and the lower pentagonal facet is F2. 



In the first case, u and v lie in a common facet F and we simply let P' = F. 
In the second case, let F be a facet not containing u nor v and let P' = Wjr(P) be the 
wedge over F. Let F\ and F2 be the two facets of P' whose intersection projects to F (see 



Figure 2.10). Let u\ and V2 be the vertices of P' that project to u and v and lie, respectively, 
on F\ and F2. Now, consider a path from u\ to 1)2 on P' and project it to a path from u to 
v on P: 

• If the path on P revisits a facet (call it G) other than F, then the path on P' 
revisits the facet that projects to the facet G. 



2.3. POSITIVE RESULTS 



77 



• If the path on P revisits F, then the path on P' revisits either Fx or Fi- 
In either case, the polytope P' does not have the non- revisiting property. □ 



Remark 2.3.5. In the proof of Lemma 2.2.4 we noted that, applied to a Hirsch-sharp 



polytope P the wedge operator Wf produced either another Hirsch-sharp polytope or a 
counter-example to the Hirsch Conjecture. The last proof shows that the latter can happen 
only if P does not have the non-revisiting property. 



Theorem 2.3.6 (Klee-Walkup [183J). The Non-revisiting Conjecture (Conjecture 2.3.3) 



holds if and only if the d-step Conjecture (Conjecture 2.3.1) holds. 



Proof. Clearly, if P has the non-revisiting property, then it satisfies the d-step Con- 
jecture. To prove the converse, we assume without loss of generality that P is a simple 
d-polytope with n facets, where n = 2d. We also assume, by induction, that all polytopes 
with number of facets minus dimension smaller than d have the non-revisiting property. 

Let u and v be two vertices of P. We argue by induction on the number of common 
facets of u and v. In the base case where u and v do not share any facet, any path of length 
n — d = d is non-revisiting. For the inductive step, assume that u and v are vertices of a 
facet F of P: 

• If the facet F has less than n—1 facets itself, then F has the non-revisiting property 
by induction on "number of facets minus dimension," and we are done. 

• If F has n — 1 facets, since it has dimension d — 1 there is a facet F' of F not 
containing u nor v. Let P' = Wf>(F). Let u\ and vi be vertices of P' projecting to 
vertices u and v of P such that Fx contains u\ and F<i contains V2- As in the proof 



of Lemma 2.3.4, Fx and F2 denote the non-vertical facets of the wedge P' . Then P' 
has dimension d and 2d facets, but u' and v' have one less facet in common than u 
and v had. By the inductive hypothesis, there is a non-revisiting path from v! to v' 
in the polytope P'. When this path is projected to F, it retains the non-revisiting 
property, and it is also a non-revisiting path on the original polytope P. 

Thus, a d-dimensional polytope with 2d facets satisfying the <i-step Conjecture is a polytope 
that has the non-revisiting property. □ 
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2.3.2. Small dimension or few facets 

Recall that H(n, d) denotes the maximum diameter among the graphs of d- dimensional 
polytopes with n facets. An exact formula for H(n, d) when d = 3 was first proved by Klee 
in [HO]. 

Theorem 2.3.7 (Klee [T80] V H(n,S) = [?§\ - 1. 

Proof. We first show that H(n, 3) < |_xJ ~~ Let be a simple 3-polytope with n 
facets. By Euler's formula, P has 2n — 4 vertices. Let v and v 1 be two distinct vertices of 



P. By Balinski's Theorem (see Theorem 1.1.3 which Balinski proved in [17J), the graphs 



of 3-polytopes are 3-connected. Thus, there are three disjoint paths that go from v to v' . 
These paths use at most 2n — 6 intermediate vertices. Therefore, the shortest of these paths 
goes through at most |_-jf J — 2 vertices, so it has at most J — 1 edges. 



Figure 2.11. A family of simplicial 3-dimensional polytopes with 3 + 3k 
vertices and diameter 2k + 1. Two steps are needed to cross each layer of six 
triangles. 



To prove H (n, 3) > [^J — 1, we first consider the case where n is a multiple of three. 
We construct a family of simplicial 3-polytopes with 3 + 3k vertices with diameter 2k + 1 for 



every k > 0. The diagram of Figure 2.11 shows the case k = 2. Starting from the triangle 
depicted in the center, one needs a minimum of three steps to reach a facet outside of the 
next equilateral triangle, and two more for each extra layer of three vertices and six triangles. 



The case of 4 + 3k (where the bound does not increase) is obtained by any stellar 
subdivision of the picture in the previous case. For 5 + 3A; we subdivide the inner triangle of 
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A 



Figure 2.12. Modification needed to the inner triangle of Figure 2.11 for 
5 + 3k vertices 



Figure 2.11 with two more vertices, as depicted in Figure 2.12, which makes one more step 
necessary to exit that triangle from any of the two shaded triangles. □ 



The Hirsch Conjecture is also known to hold when P does not have too many facets. By 



Theorem 2.3.2, if we fix k, the Hirsch Conjecture for d-polytopes with d + k facets follows 
from the conjecture for fc-polytopes with 2k facets. Hence, the case n — d < 3 follows from 



Theorem 2.3.7 The cases when n — d < 5 were proved by Klee and Walkup in [183J. The 
case n — d = 6 was recently proved by Bremner and Schewe in |50j . 

Theorem 2.3.8 ([183], [50]). H(n,d) <n-d when n-d<6. 



It is worth noting that this and Theorem 2.3.7 exhaust all the pairs (n,d) where the 



Hirsch Conjecture is known to hold. In particular, the Hirsch Conjecture is open for 
4-polytopes with 13 facets. 

In summary, the pairs (n, d) for which the Hirsch Conjecture is known to be true is: 



(1) d < 3, and n arbitrary; 



(2) n < d + 6 and d arbitrary. This follows from Theorem 2.3.2 by showing H(8, 4) = 4, 



#(10, 5) = 5 and H(12, 6) = 6. The first two were proved by Klee and Walkup 
in [183]. The third was proved by Bremner and Schewe in [50j . 
(3) -ff(ll,4) = 6 was proved by Schuchert (see [248J). Following up on [50], Bremner, 
Deza, Hua, and Schewe (see [49]) proved #(12,4) = JT(12,5) = #(13,6) = 7. 



Combining this with the information on Hirsch-sharp polytopes we can give a "plot" of 



the function H(n, d) — (n — d) in Table 2.3.2 The horizontal coordinate is n — 2d, so that 
the column marked "0" corresponds to the polytopes relevant to the cf-step Conjecture. 
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Table 2.1. -£f(n, d) versus n — d, the state of the art 



2.3.3. General upper bounds on diameters 

The best upper bound known for the diameter of arbitrary polytopes is given in [TTOj by 
Kalai and Kleitman. This subexponential bound holds even for unbounded polyhedra. The 
proof is so short (it is only half a page!) that we reproduce it here: 

Theorem 2.3.9 (Kalai-Kleitman |170| ). Every polyhedron of dimension d and with n facets 
has diameter bounded above by n log2 ^ +2 = n 2 d log2 ' n . 

Proof. Let P be a d-dimensional polyhedron with n facets, and let v and u be two 
vertices of P. Let k v (respectively k u ) be the maximal positive number such that the union 
of all vertices in all paths in G(P) starting from v (respectively u) of length at most k v 
(respectively k u ) are incident to at most ^ facets. Clearly, there is a facet F of P so that we 
can reach F by a path of length k v + 1 from v and a path of length k u + 1 from u. 

Recall that H u (n,d) denotes the maximum diameter among all d-dimensional polyhedra 
with n facets. We claim that k v < J > d). (Of course, this implies that the same bound 

holds for k u .) 

To prove this, let Q be the polyhedron defined by taking only the inequalities of P 
corresponding to facets that can be reached from v by a path of length at most k v . By 
construction, all vertices of P at distance at most k v from v are also vertices in Q, and 
vice-versa. In particular, if w is a vertex of P whose distance from v is k v then its distance 
from v in Q is also k v . Since Q has at most n/2 facets, we get k v < H u ([^\,d). 
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The claim implies the following recursive formula for H u : 



H u (n,d) < 2H U 



d) +HJn,d- l) + 2, 



which we can rewrite as 



H u (n, d) + 1 < H u ([±\ ,d) + l H u (n,d + 1 



n 



n/2 



■n 



This suggests calling h(k, d) := (H(2 k , d) — l)/2 k and applying the recursion with n = 2 k , 



to get: 



h(k, d) < h(k -l,d) + h(k, d-l). 



This implies h(k,d) < ( k ~j i d ), or 



.//,, (2'' . <l) < *( k + d 



From this the statement follows if we assume n < 2 d (that is, k < d). For n > 2 d we use 
Larman's bound H u (n, d) < n2 d < n 2 , proved below. □ 



The diameters of polytopes of a fixed dimension d admit a linear bound. An upper 
bound of n3 d ~ 3 was found by Barnette in 1967 (see |23j ). Larman (see |197| ) then improved 
the bound to n2 d ~ 3 . The proof presented here is taken from a very recent paper (see |122] ) 
by Eisenbrand, Hahnle, Razborov, and RothvoB. As in the previous result, we prove it for 
all polyhedra, not just bounded polytopes. 

Theorem 2.3.10 (Larman |197j ). For every n > d > 3, the maximal diameter H u (n, d) of 
d- dimensional polyhedra with n facets is no more than n2 d ~ 3 . 



Proof. The proof is by induction on d. The base case d = 3 was Theorem 2.3.7 

Let u be an initial vertex of our polytope P, of dimension d > 3. For each other vertex 

v £ vert(P) we consider its distance dist(«i, v), and use it to construct a sequence of facets 

Fi, . . . , of P as follows: 

• Let F\ be a facet that reaches "farthest from u" among those containing u. That 
is, let <5i be the maximum distance to u of a vertex sharing a facet with u, and let 
F\ be that facet. 
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• Let 82 be the maximum distance to u of a vertex sharing a facet with some vertex 
at distance 5% + l from u, and let Fi be that facet. 

• Similarly, while there are vertices at distance 8% + 1 from u, let 8%+t be the maximum 
distance to u of a vertex sharing a facet with some vertex at distance Si + 1 from u, 
and let Fi+i be that facet. 

We now stratify the vertices of P according to the distances 81,62, ■■■ ,8k so obtained. 
Observe that 8k is the diameter of P. By convention, we let 8q = — 1: 

Vi := {v G vert(P) : dist(u,u) € (<5;_i, <5j]}. 

We call a facet F of P active in Vi if it contains a vertex of Vi. The crucial property that 
our stratification has is that no facet of P is active in more than two V^s. Indeed, each 
facet is active only in V^s with consecutive values of i, but a facet intersecting Vi, Vi+i and 
Vi+2 would contradict the choice of the facet Fi+\. In particular, if rii denotes the number 
of facets active in Vi we have 

k 

rii < 2n. 

i=l 

Since each Fi has vertices with distances to u ranging from at least <5j_i + 1 to 8i, we have 
that diam(i ? j) > 8i — 8i—\ — 1. Even more, let Qj, i = 1, . . . , k be the polyhedron obtained 
by removing from the facet-definition of Fi the equations of facets of P that are not active in 
Vi (which may exist since Fi may have vertices in Vi-i). By an argument similar to the one 
used for the polyhedron Q of the previous proof, Qi has still diameter at least Si — <5j_i — 1. 
But, by inductive hypothesis, we also have that the diameter of Qi is at most 2 d_4 (nj — 1), 
since it has dimension d — 1 and at most — 1 facets. Putting all this together we get the 
following bound for the diameter 8k of P: 

k 

h = - - 1) + (fc - 1) 

i=i 
k 

< ^2 d - 4 (n i -l) + fc 

i=l 

= 2 d ~ 4 ^rii- k(2 d ~ 4 - 1) < 2 d ^n. 

i 
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□ 



Remark 2.3.11. It has been pointed out repeatedly that the proofs of Theorems 2.3.9 



and 2.3.10 use only very limited properties of graphs of polytopes. For example, Klee and 



Kleinschmidt (see Section 7.7 in [181] ) show that Theorem 2.3.10 holds for the ridge-graphs 
of all pure simplicial complexes, and more general objects. Recently, Eisenbrand et al. [122J 
have shown the following similar result: 

Theorem 2.3.12 (Eisenbrand et al. [122]). Let G be a graph whose vertices are certain 
subsets of size d of the n-element set {1, . . . , n}. Assume that for every pair of vertices u 
and v in G there is a path from u to v using only vertices that contain uDv. Then: 

(1) diam(G) < n 1+lo ^ d . 

(2) diam(G) < n2 d - 1 . 

Remark 2.3.13. Kalai remarks that the first bound n 1+logrf holds for unbounded polyhedra. 



The novelty in [122J is that there are graphs with the hypotheses of Theorem 2.3.12 



and with diam(G) > cre 3//2 , for a certain constant c. It is not clear whether this is support 
against the Hirsch Conjecture or it simply indicates that the arguments in the proofs of 



Theorems 2.3.9 and 2.3.10 do not take advantage of additional properties that graphs of 
polytopes have and which may prevent their diameters from growing. For example, in 
Eisenbrand et al.'s setting for d = 1, any connected graph with vertex set {1, . . . , n} is valid. 
(Slightly different formulations for graphs of abstract polytopes are studied in, e.g., [2J, [3J, 
and [198].) 

Since the Hirsch Conjecture is strongly motivated by the simplex algorithm of linear 
programming, it is natural to ask questions about the number of iterations needed under 
particular pivot rules. Kalai, and independently, Matousek, Sharir and Welzl (see [168J 
and [211] . respectively) proved the existence of randomized pivot rules for the simplex 
method with subexponential running time. 

Theorem 2.3.14 (Kalai [168] : Matousek, Sharir, and Welzl [211] ). There exist randomized 
algorithms where the expected number of arithmetic operations needed in the worst case by 
a linear programming problem with d variables and n inequalities is at most exp(K^dlogn), 
where K is a fixed constant. 
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Also, though polynomial bounds on the diameter of all polytopes are unknown, there 
are results saying that random polytopes have polynomial diameter. One way to formalize 
this is by considering perturbations of the facet-defining inequalities of the given polytope. 
Specifically, let P be the feasibility polyhedron 

P = {x e R d | (a i: x) < b, (i = 1, . . . , n)} 

of a certain linear program. Then P has dimension d and (at most) n facets. If we replace 
the vectors dj £ M. d and b G M n with independent Gaussian random vectors with means 
Hi = en and fi = b (respectively), and standard deviations cr(niaxj \\(fii, we say that we 
have perturbed randomly within a parameter a. In [255 , Spielman and Teng proved that the 
expected diameter of a linear program that is perturbed within a parameter a is polynomial 
in d, n, and cr _1 . In [280J, Vershynin improved the bound to be polylogarithmic in the 
number n of inequality constraints. 

Theorem 2.3.15 (Vershynin [280j). If the original linear program is perturbed randomly 
within a parameter a, the expected diameter is 0(d 9 <7~ 4 log 4 d log 2 n). 

It is worth noticing that this result is not only structural. Its proof shows that the 
simplex method can find a path of that length in the perturbed polyhedron. 

2.3.4. A continuous Hirsch Conjecture 

Here we summarize some recent work of Deza, Terlaky, and Zinchenko (see [109j . |110j . 
and [108J) in which they propose interesting continuous analogues of the Hirsch Conjecture 
and the d-step Conjecture. The analogy comes from looking at the central path method 
for linear and convex optimization. As in the simplex method, the idea is to move from 
a feasible point to another feasible point on which the given objective linear functional 
is improved. In contrast to the simplex method, where the path travels from vertex to 
neighboring vertex along the graph of the feasibility polyhedron P, this method follows a 
certain curve through the strict interior of the polytope. 

Inspired by work of Dedieu, Malajovich, and Shub (see |106j ). Deza et al. consider the 
total curvature of a central path on a polytope to be an analogue of the graph-diameter, and 
show lower bounds that can be interpreted as "continuous Hirsch-sharpness," as well as a 
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result analogue to the equivalence between the Hirsch Conjecture and the d-step Conjecture. 
More precisely, to each linear program, 

Minimize subject to Bx = a and x > 0, 

the method associates a (primal) central path 7^ : [0,/3) — > M. d which is an analytic curve 
through the interior of the feasible region and such that 7^(0) is an optimal solution of 
the problem. The central path is well-defined and unique even if the program has more 
than one optimal solution, but its definition is implicit, so that there is no direct way of 
computing 7^(0). To get to 7^(0), one starts at any feasible solution and tries to follow a 
curve that approaches more and more the central path, using for it certain barrier functions. 
(Barrier functions play a role similar to the choice of pivot rule in the simplex method. 
The standard barrier function is the logarithmic function f(x) = — Y27=i m {AiX — hi).) For 
further description of the method, refer to |48j and |241j . 

In practice, one does not follow the curve exactly. Rather, one does Newton-like iteration 
steps trying not to get too far from the actual curve. How much can one improve in a single 
step is related to the curvature of the central path, if the path is rather straight one can do 
long steps without deviating too far from it. Otherwise, one needs to use shorter steps. Thus, 
the total curvature A^(P) of the central path, defined in the usual differential-geometric way, 
is an important parameter which can be considered a continuous analogue of the diameter of 
the polytope P, or at least of the maximum distance from any vertex to a vertex maximizing 
the functional £. 



Theorem 2.3.20 below yields an upper bound that is in 0(n d ) for A(n,d), but it had 
been conjectured that A^(P) is bounded by a constant for each dimension d and that it 
grows at most linearly with varying d. Deza et al. have disproved both statements: in |109j . 
they constructed polytopes for which \^(P) grows exponentially with d. More strongly, 
in |108 ] they construct a family of polytopes that show that the total curvature A^ cannot 
be bounded only in terms of d. To state this result, let A(n, d) denote the largest total 
curvature of the central path over all polytopes P defined by n inequalities in dimension d 
and over all linear objectives £. They show: 

Theorem 2.3.16 ([108]). For every fixed dimension d>2, hminf n _>.oo > 7r. 
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Deza et al. consider this result a continuous analogue of the existence of Hirsch-sharp 
polytopes. Motivated by this they pose the following conjecture: 

Conjecture 2.3.17 (Continuous Hirsch Conjecture). A(n, d) £ 0(n). That is, there is a 
constant K such that A(n, d) < Kn for all n and d. 



Theorem 2.3.16 says that if the Continuous Hirsch Conjecture is true, then it is (asymp- 
totically) tight. Deza et al. also conjecture a continuous version of the d-step Conjecture, and 
show it to be equivalent to the Continuous Hirsch Conjecture, thus providing an analogue of 



Theorem 12.3.21 

Conjecture 2.3.18 (Continuous ci-step Conjecture). The function A(2d,d) grows linearly 
in its input. That is to say, A(2d, d) is 0(d). 

Theorem 2.3.19 ( [110j ). The Continuous Hirsch Conjecture is equivalent to the Continuous 
d-step Conjecture. That is, if A(d, 2d) G 0(d) for all d, then A(n, d) E 0(n) for all d and n. 

The curvature of the central path had also been studied by Dedieu, Malajovich, and 
Shub in |106j . But instead of looking at a single polytope, they consider the average total 
curvature of the central paths of all bounded cells in a simple arrangement. 

An arrangement A of n hyperplanes in dimension d is called simple if every n hyperplanes 
intersect at a unique point. (Any sufficiently generic collection of n > d + 1 hyperplanes 
in d-dimensional space is simple.) It is easy to show that any simple arrangement of n 
hyperplanes in M d has exactly a = ("7 ) bounded full-dimensional cells. For a simple 
arrangement A with bounded cells P\, . . . ,P a and a given objective function £, Dedieu et al. 
define: 

h{A) = -j^\c(Pi). 

a £ — » 

i=i 

They prove: 

Theorem 2.3.20 ( |106| ). X^(A) < 2nd, for every simple arrangement. 

That is, even if individual cells can give total curvature linear in n (and perhaps worse) 



by Theorem 2.3.16, the average over all cells of a given arrangement is bounded by a function 
of d alone. By analogy, Deza et al. (see [J08J) consider the average diameter of the graphs 
of all bounded cells in a simple arrangement A. Denote it 5(A) and let H_^(n,d) be the 
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maximum of 6(A) over all simple arrangements denned by n hyperplanes in dimension d. 
They conjecture that: 

Conjecture 2.3.21 (Hyperplane Diameter Conjecture). H^(n,d) < d. 

They notice that this inequality, modulo a linear factor, would follow from the Hirsch 
Conjecture: 

PROPOSITION 2.3.22. If the Hirsch Conjecture holds, then Hj^(n,d) < d + 

Proof. Let {Pi \ i £ 1} be the collection of bounded cells of A. Let m < n denote 
the number of facets of the bounded cell P{. If the Hirsch Conjecture holds, we have 
6 (P%) < Tii — d, which implies 6(A) < ^ = n ' — n. Since a facet belongs to at 

most two cells, Yli=i n i * s ^ ess than twice the number of bounded facets of A. Now since a 
bounded facet contained in a hyperplane H of the arrangement A corresponds to a bounded 

cell of the simple arrangement Af\ H of one dimension less, we have Yll=i n i — ^ n (!d—i)' 

2 n ( n ~ 2 \ 

Thus we obtain 6(A) < (n-~i\ — d = — d, which implies the bound. □ 
2.3.5. Special classes of polytopes 

Where the conjectured upper bound cannot be proved, it is interesting to study upper 
bounds for special families of polytopes. Many of the polytopes appearing in combinatorial 
optimization belong to the class of network flow polytopes, which include transportation 
polytopes. In this section we mention that polynomial bounds are known for these and for 
some generalizations and related classes of polytopes. In addition, polytopes whose vertex 
coordinates are all zeroes and ones satisfy the Hirsch Conjecture. 

Integer vertices 

A polytope P is called a 0-1 polytope if every coordinate of every vertex of P is either or 
1. That is to say, P is the convex hull of some of the vertices of a cube. In [224] , Naddef 
proved that the Hirsch Conjecture holds for 0-1 polytopes. 

Theorem 2.3.23 (Naddef [224]). Let P be a d-polytope with n facets such that every vertex 
of P is in {0, l} d . Then diam(P) < n - d. 
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Proof. We assume that P is full- dimensional. This is no loss of generality since, if the 
dimension of P is strictly less than d, then P can be isomorphically projected to a face of 
the cube [0, l] d . 

Let u and v be two vertices of P. By symmetry, we may assume that u = (0, . . . , 0). If 
there is an i such that V{ = 0, then u and v are both on the face of the cube corresponding 
to {x 6 M d | Xi = 0}, and the statement follows by induction. Therefore, we assume that 
v = (1, . . . , 1). Now, pick any neighboring vertex v' of v. There is an i such that v\ = 0. 
Then, u and v' are vertices of a lower-dimensional 0-1 polytope and we have performed one 
pivot from v to v' . The result follows by induction on d. □ 



In [229J, Onn and Rothblum prove the following strengthening of Theorem 2.3.23 

Theorem 2.3.24 (Onn and Rothblum |229| ). Let P be a d-polytope with n facets such that 
every vertex of P is in {0, l} d . Then, under any linear functional £, there is a non-decreasing 
path from any vertex v to any vertex v* maximizing £, of length at most d using each 
edge-direction at most once. 



As a generalization of Theorem 2.3.23, in |185j Kleinschmidt and Onn prove the following 
bound on the diameter of lattice polytopes in [0, k] d . A polytope is called a lattice polytope 
if every coordinate of every vertex is integral. 

Theorem 2.3.25 (Kleinschmidt and Onn [185J ) . The diameter of a lattice polytope contained 
in [0, k] d cannot exceed kd. 

Using this theorem, one can obtain a bound for the diameter of all rational d-dimensional 
polytopes with n facets. However, the bound obtained is not polynomial in n and d, and it 
does not improve on the bound in |170j of Kalai and Kleitman (see [228] ). 



Transportation Polytopes 

Assuming the Hirsch Conjecture is true, every p x q transportation polytope has diameter 
at most p + q — 1. Brightwell, van den Heuvel, and Stougie (see |51] ) prove the diameter to 
be at most eight times that. This has been improved by Hurkens (see |162j) to: 

Theorem 2.3.26 (Hurkens [162]). The diameter of any p x q transportation polytope is at 
most 3(p + q — 1). 
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The dual transportation polyhedron given by the pxq matrix C = (Cj,) is the unbounded 
polyhedron D Ptq {C) = {(tii, • • . , u p , vi,..., v q ) G W p+q \ Ui + Vj < Cij for all i, j}. Duality 
here is in the sense of linear programming. In [18] . Balinski proved that the Hirsch Conjecture 
holds for the bounded polytopes resulting from the intersection of a dual transportation 
polyhedron D Ptq (C) with a certain hyperplane. 

Theorem 2.3.27 (Balinski |18j). Let C be an p x q matrix. The diameter of the polytope 
D Ptq (C)n{(u u ...,u p ,v 1 ,...,v q ) eR p+q K = 0} 

is at most (p — l)(q — 1). This bound is the best possible. 

In joint work with De Loera, Onn, and Santos (see |97J), we prove a quadratic bound (in 
p+q+s) on the diameter of 3-way axial pxqxs transportation polytopes. By the universality 



theorem of De Loera and Onn in |101j (see also Theorem 1.4.21 ), a generalization of this 
bound to faces of these polytopes would prove the polynomial Hirsch Conjecture. 

Theorem 2.3.28 (|97J). The diameter of every 3-way axial pxqxs transportation polytope 
is at most 2(p + q + s — 3) 2 . 

In Chapter |3j we examine the diameters of 3-way transportation polytopes, where we 



will present and prove Theorem 2.3.28 See Section 3.6 



Network flow polytopes 

Network flow polytopes are another generalization of transportation polytopes. Let G = 
(V,E) be a graph with v = \ V\ nodes and e = \E\ directed arcs. For each arc £ E, fix 
a capacity lower bound hj and upper bound Uij. The network flow polytope P determined 
by the directed graph G with capacity bounds {(hj,Uij) \ G E} and demand function 
c : V — > K is a polytope in the e variables ((i, j) G E) with the 2e inequalities 

hj — x ij — u ij-, f° r each G E 

and the v equations 

— x ji = C W) f° r each i G V. 

{jeV\(i,j)eE} {jeV\(j,i)eE} 
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Note that the p x q transportation polytope defined by vectors a G W and b E M q can be 
obtained as the network flow polytope associated to the complete bipartite graph G on p 
and q nodes with all edges directed in the same direction, by taking Uj = 0, Uij = oo and 

Cj = -en for i = 1, . . . ,p, c i+p = hi for i = 1, . . . , q. 

Every sufficiently generic set of parameters produces a simple (e — v + l)-dimensional 
polytope P with at most 2e facets. The results in [SO], |141| . and [230J prove the following 
upper bound on the diameter of network flow polytopes. 

Theorem 2.3.29. The diameter of the network Bow polytope on a directed graph G = (V,E) 
is O(evlogv). 

We note that evlogv is 0(n 2 logra), where n is the number of facets of P. 

A matrix is totally unimodular if all of its subdeterminants are 0, 1, or —1. Dyer and 
Frieze (see |117] ) gave a polynomial upper bound on the diameter of polyhedra whose 
defining matrix is totally unimodular, a case that includes all transportation and network 
flow polytopes. 

Theorem 2.3.30 ( |117| ) . Let A be a totally unimodular n x d matrix and let b G M n . The 
diameter of the polyhedron P = {x £ R d Ax < b} is in 0(d 1G n 3 (log(dn)) 3 ). 

In the statement, the polyhedron P can be assumed to be a (i-polytope with n facets. 

2.4. Negative results 

To contrast the previous section, we present counter-examples to three natural variants of 
the Hirsch Conjecture. 

2.4.1. The Unbounded and Monotone Hirsch Conjectures are false 

In the Hirsch Conjecture as we have stated it, we only consider bounded polytopes. However, 
in the context of linear programming the feasible region may well not be bounded, so the 
conjecture is equally relevant for unbounded polyhedra. In fact, that is how Hirsch originally 
posed the question (see page 168 of |88j). 
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Moreover, for the simplex method in linear programming one follows monotone paths: 
starting at an initial vertex x of the feasibility polyhedron P, one does pivot steps (that is, 
one moves along edges) always increasing the value of the linear functional £ to be maximized, 
until one arrives at a vertex x* where no pivot step gives a greater value to £. Convexity then 
implies that the vertex x* is the global maximum for £ in the feasible region. This raises the 
question whether a monotone version of the Hirsch Conjecture holds. Both variants of the 
Hirsch Conjecture fail. But before going any further, let us state both conjectures carefully: 

• The Unbounded Hirsch Conjecture: Is the diameter of every (bounded or not) 
d-polyhedron P with n facets at most n — d? 

• The Monotone Hirsch Conjecture: Is there, for every d-polytope P with n facets 
and every linear functional £, a ^-monotone path with at most n — d edges from 
any vertex v to a vertex v* where £ is maximized? Monotonically means that we 
require the value of £ to increase at every step. 

Here we show that the unbounded and monotone versions of the Hirsch Conjecture fail. 



Both proofs are based on the Hirsch-sharp polytope Q4 described in Section \2. 2. 2[ In fact, 
we want to emphasize that knowing the mere existence of such a polytope is enough. We 
are not going to use any property of Q4 other than the fact that it is Hirsch-sharp, simple, 
and has n > 2d. Simplicity is not a real restriction since it can always be obtained without 



decreasing the diameter (see Lemma 2.0.5). The inequality n > 2d, however, is essential as 



the constructions below would not work with, for example, a d-cube. 

Theorem 2.4.1 (Klee-Walkup [183J). There is a simple unbounded polyhedron Q4 with 
eight facets and dimension four and whose graph has diameter five. 

Proof. Let Q4 be the simple Klee-Walkup polytope with nine facets, and let u and v 
be vertices of Q4 at distance five from one another. By simplicity, the vertices u and v lie in 
(at most) eight facets in total and there is (at least) one facet F not containing u nor v. 

Let Q4 be the unbounded polyhedron obtained by a projective transformation that sends 
this ninth facet to infinity. The graph of Q4 contains both u and v, and is a subgraph of 



that of Q4, hence its diameter is still at least five. See Figure 2.13 for a schematic rendition 



of this idea. □ 
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7t 



Figure 2.13. Disproving the Unbounded Hirsch Conjecture 

Remark 2.4.2. It is interesting to observe that the "converse" of the above proof also works: 
from any non-Hirsch unbounded polyhedron Q with eight facets and dimension four, one 
can build a bounded polytope with nine facets and diameter still Eve, as follows: 

Let u and v be vertices of Q at distance five from one another. Construct the polytope 
Q by cutting Q with a hyperplane that leaves all the vertices of Q on the same side. This 
adds a new facet and changes the graph, by adding new vertices and edges on that facet. 
But u and v will still be at distance Eve: to go from u to v either we do not use the new 
facet F that we created (that is, we stay in the graph of Q4) or we use a pivot to enter the 
facet F and at least another four to enter the four facets containing v: since the Hirsch 
Conjecture holds for 3-dimensional polyhedra, u and v cannot lie in a common facet of Q. 

We now turn to the Monotone Hirsch Conjecture: 

Theorem 2.4.3 (Todd [274J). There is a simple bounded A-polytope P with eight facets, 
two vertices u and v of P, and a linear functional £ such that: 

(1) v is the only maximal vertex for £. 

(2) Any edge-path from u to v and monotone with respect to ^ has length at least five. 

Proof. Let Q4 be the Klee-Walkup polytope. Let F be the same "ninth facet" as in 
the previous proof, one that is not incident to the two vertices u and v that are at distance 
five from each other. Let H2 be the supporting hyperplane containing F and let Hi be 
any supporting hyperplane at the vertex v. Finally, let Hq be a hyperplane containing the 
(codimension two) intersection of H\ and H2 and which lies "slightly beyond Hi," as in 



Figure 2.14 (Of course, if Hi and H2 happen to be parallel, then Hq is taken to be parallel 
to them and close to Hi.) The exact condition we need on Hq is that it does not intersect 
Q4 and the small, wedge-shaped region between Hq and Hi does not contain the intersection 
of any 4-tuple of facet-defining hyperplanes of Q4. 
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H, 



1. 



H 




Figure 2.14. Disproving the Monotone Hirsch Conjecture 

We now make a projective transformation it that sends Hq to be the hyperplane at 
infinity. In the polytope Q' 4 = t^{Qa) we "remove" the facet F' = tt(F) that is not incident 
to the two vertices u' = tt(u) and v' = ir(v). That is, we consider the polytope Q'l obtained 
from Q' 4 by forgetting the inequality that creates the facet F' (see Figure 2.14 again). Then 



Q'l will have new vertices not present in Q' 4 , but it also has the following properties: 

(1) Q'l is bounded. Here we are using the fact that the wedge between Hq and Hi 
contains no intersection of facet-defining hyperplanes: this implies that no facet of 
Q'l can go "past infinity." 

(2) It has eight facets: four incident to u' and four incident to v' . 

(3) The functional £ that is maximized at v' and constant on its supporting hyperplane 
H[ = it (Hi) is also constant on H' 2 = ^(H?), and v! lies on the same side of H[ as 



In particular, no £- monotone path from u' to v' crosses H[, which means it is also a 



path from u' to v' in the polytope Q' 4 , combinatorially isomorphic to Q4. 



□ 



The diameter of Todd's polytope is four, so it does not give a counter-example to the 
Hirsch Conjecture. 

In both the constructions of Theorems 2.4.1 and |2.4.3| one can glue several copies of 
the initial block Q4 to one another. The basic idea is (the polar of) the same one used in 
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Corollary |2, 2. 11 We skip details, but in both cases we increase the number of facets by four 



and the diameter by five, per Q4 glued, obtaining: 

Theorem 2.4.4 (Klee-Walkup, Todd). The Unbounded and Monotone Hirsch Conjectures 
are false: 

(1) There are unbounded A-polyhedra with 4 + Ak facets and diameter 5k, for every 
k>l. 

(2) There are bounded A-polyhedra with 5 + Ak facets and vertices u and v of them 
with the property that any monotone path from u to v with respect to a certain 
linear functional £ maximized at v has length at least 5k. 

This leaves us with the following questions: 

Open Problem 2.4.5. Improve these constructions so as to get the ratio of "diameter versus 
facets" bigger than 5/4. Can the ratio be as big as two? That is: Is there a d-dimensional 
polytope P with n facets having the property that, for every linear functional £ on P, 
there is a ^-monotone path with at least 2(n — d) from any vertex v to a vertex v* where 
£ is maximized? Is there an unbounded d-dimensional polyhedron P with n facets whose 
diameter is 2(n — d) or greater? 

A ratio bigger than two for the Unbounded Hirsch Conjecture would probably yield 
counter-examples to the bounded Hirsch Conjecture. 

Ziegler (see page 87 of |290| ) poses the following conjecture stronger than the Monotone 
Hirsch Conjecture: 

Conjecture 2.4.6 {Strict Monotone Hirsch Conjecture). For every linear functional £ on a 
d-polytope with n facets there is a ^-monotone path of length at most n — d. 

Put differently, in the Monotone Hirsch Conjecture we add the requirement that not 
only v but also v* has a supporting hyperplane where £ is constant. 

Finally, even though the Unbounded and Monotone Hirsch Conjectures are false in 
general, it would be interesting to know if there are any non-trivial families of polytopes (or 
spheres) where it is true: 

Open Problem 2.4.7. Are there any non-trivial (infinite) families of polyhedra for which 
the Unbounded Hirsch Conjecture holds? 
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Open Problem 2.4.8. Are there any non-trivial (infinite) families of polyhedra for which 
the Monotone Hirsch Conjecture holds? 

2.4.2. The Topological Hirsch Conjecture is false 

We discuss a topological generalization of the Hirsch Conjecture. Since (the boundary of) 
every simplicial (i-polytope is a topological triangulation of the (d — l)-dimensional sphere, 
we can ask whether the simplicial version of the Hirsch Conjecture, the one where we walk 
from simplex to simplex rather than from vertex to vertex, holds for arbitrary triangulations 
of spheres. As in the previous section, to be precise: 

• The Topological Hirsch Conjecture: IfT is topological triangulation of the (d — 1)- 
sphere with n vertices and G A (T) is its polar graph, is diam(G A (T)) at most 
n — d? 

Here, diam(G A (T)) denotes the diameter of the polar graph, as in the case of simplicial 
polytopes. This generalizes the Hirsch Conjecture since the proper faces of a simplicial 
(i-polytope form a triangulation of the (d — l)-sphere. But the converse is not true: starting 
in d = 3 and with 8 vertices there are non-polytopal triangulations of (i-spheres, that are not 
combinatorially isomorphic to the boundary of any polytope. 

The first counter-example to this statement was a simplicial non-Hirsch 27-sphere Z with 
56 vertices found by Walkup in 1979 (see [285] ), but simpler counter-examples were soon 
constructed by Walkup and Mani in [208]. (The analogous problem for polyhedral maps on 
general surfaces is studied in |237j .) Both constructions are based on the equivalence of 



the Hirsch Conjecture to the Non-revisiting Conjecture (see Theorem 2.3.6). The proof of 
the equivalence is purely combinatorial, so it holds true for topological spheres. Mani and 
Walkup's counter-examples are 3-spheres C and D (respectively) with 20 and 16 vertices 
(respectively). We only describe D here, which is obtained from C by contracting four 
edges and deleting degenerate tetrahedra. Wedging on D eight times produces a non-Hirsch 
11-sphere E with 24 vertices. 

Theorem 2.4.9 (Mani- Walkup [208]). There is a triangulated 3-sphere D with 16 vertices 
and without the non-revisiting property. Wedging on it eight times produces a non-Hirsch 
11-sphere E with 24 vertices. The dual diameter of E is more than 12. 
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The part of the Mani-Walkup 3-sphere D that implies failure of the non-revisiting 
property involves only 12 of the 16 vertices. More precisely, Mani and Walkup show the 
following: 

Lemma 2.4.10. Let K be the three-dimensional simplicial complex on the vertices a, b, c, d, 
m, n, o, p, q, r, s, and t consisting of the following 26 tetrahedra: 

abed mnop 
abcr acdr mnoq mopq 

abdt bedt nops mnps 

abmr benr anoq bopq 

cdor dapr cpmq dmrq 

abmt bent anos bops 

cdot dapt cpms dmrs 

Then: 

(1) The complex K can be embedded in a 3-sphere. 

(2) No triangulation of the 3-sphere containing K as a subcomplex has the non-revisiting 
property. 

(3) There is a triangulation of the 3-sphere with 16 vertices and containing K as a 
subcomplex. 

We are not going to prove part 3 of the lemma. The construction is somehow complicated 
and, moreover, in a sense that part is irrelevant. Indeed, once we know that the complex K 
can be embedded in a 3-sphere we can rely on Whitehead's Completion Lemma (sec [287J) 
to conclude that the complex K can be completed to a triangulation of the whole 3-sphere. 
The only drawback of this approach is that we cannot control a priori the number of extra 
vertices needed in the completion, but that will only affect the number of vertices of the 
final 3-sphere (and the number of wedges needed to get a non-Hirsch sphere from it). 

Proof of parts 1 and 2 of Lemma [2.4.101 The proof follows from the following 
description of the simplicial complex K: it consists of two triangulations of bipyramids over 
the octagons ambneodp and aobpcmdn, glued along the eight vertices of the octagons. See 
Figure [2T5. 
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Figure 2.15. Two octagonal bipyramids in the triangulation of S depicting 
the outside-most 16 tetrahedra 



Once this is shown, part 1 is easy. Since we are living in a topological world, we can 
"pinch" the equatorial vertices of one of the octagons and there is no obstruction to glue 
them to their counterparts in the other octagon. One key property is that we are not gluing 
any of the edges: the order of vertices in the two octagons is not the same, and it is designed 
so that no edge appears in both. 

For part 2, let us see the construction in more detail. It starts with a core tetrahedron 



inside each bipyramid, namely abed and mnop. See Figure 2.16 




Figure 2.16. The simplices abed and mnop 
Each of these tetrahedra is surrounded by two tetrahedra joined to each apex of the 



corresponding bipyramid, as shown in Figure 2.17 These are the tetrahedra in the second 



and third line of the statement and together with the initial ones they triangulate two 
octahedra. Finally, these octahedra are each surrounded by eight more tetrahedra each: 
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those obtained joining the four triangles left uncovered in each octagon (see Figure 2.17 
again) to the two apices of their bipyramid. 




Figure 2.17. Eight additional simplices in the Mani-Walkup triangulation 



From this description it is easy to prove part 2 of the lemma, as follows: Every path 
from the tetrahedron abed to the tetrahedron mnop must leave the bipyramid on the left of 



Figure 2.15 and it will do so through one of the sixteen boundary triangles. These triangles 
are the joins of the eight edges of the octagon to the two apices. In particular, our path will 
at this point have abandoned three of the vertices of abed and entered one of mnop. For 
the non-revisiting property to hold, the abandoned ones should not be used again, and the 
entered one should not be abandoned, since it is a vertex of our target tetrahedron. But 
then it is impossible for us to enter the second bipyramid: we should do so via another 
triangle that joins an octagon edge to an apex, and non-revisiting implies that this edge 
should use the same vertex form abed and the same vertex from mnop. This is impossible 
since the two octagons have no edge in common. 

Let us explain this in a concrete example. By symmetry, there is no loss of generality 
in assuming that we exit from the left bipyramid via the triangle amr. Since we cannot 
abandon m, we must enter the second bipyramid via one of the boundary triangles using m, 
namely one of mcs, mcq, mds or mdq. This violates the non-revisiting property, since c and 
d had already been abandoned. □ 

Unfortunately, this triangulated 3-sphere does not give a counter-example to the Hirsch 
Conjecture. It would give a counter-example if it were polytopal (that is, if it were combi- 
natorially isomorphic to the boundary complex of a four-dimensional polytope). However, 
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Altshuler (see [7j) has shown that (for the explicit completion of the subcomplex K given 
in [208J) this is not the case. We summarize the proof here. 

Theorem 2.4.11 (Altshuler [7]). The topological triangulated spheres C, D, and E are not 
polytopal. 

Proof. Suppose for a contradiction that the 3-sphere C with 20 vertices had a polytopal 
realization P. Define Q to be the convex hull of all vertices of P except for the vertex t 
(defined in |208j). The polytope Q has a triangulation using no additional vertices, so the 
star stare (t) of t can be replaced by a simplicial complex C which only uses vertices of C 
and whose boundary complex coincides with that of stare (t). Altshuler proves that the 
triangle abd is in the link lkc(t) of t. Then, one of abdc, abdi, abdj, abdk, abdm, abdn, abdo, 
abdp, or abds must be a tetrahedron in C . But, since the edges ac, di, dj, aj, ak, bl, dm, 
an, ao, bp, and as appear in the anti-star astc(i), none of them can appear in the complex 
C , which means that none of the ten tetrahedra could exist. Contradiction. 

The same proof works on the 3-sphere D with 16 vertices. The polyhedron Q is again 
defined by taking the convex hull without the vertex t. The 11-sphere E is not polytopal 
since it is built up from D by repeated applications of the one-point suspension. □ 

It remains an open question if any completion of K to the 3-sphere is polytopal. 

Open Problem 2.4.12. Find a completion of the complex K to a 3-spnere that is polytopal. 

Even more strongly, it is probably the case that K cannot be embedded in M 3 with linear 
tetrahedra, a necessary condition for polytopality by the Schlegel construction (see [290J). 

Open Problem 2.4.13. Embed the complex K in M 3 using linear tetrahedra. 

As in the monotone and bounded cases, several copies of the construction can be glued 
to one another. Doing so provides triangulations of the 11-sphere with 12 + 12 k vertices and 
diameter at least 13/c, for any k. It remains to know how far this false conjecture is from 
being true. Examples whose diameter is 2(n — d) or more would be very significant. Current 
ones achieve y| (n — d) . 

Open Problem 2.4.14. Is there a triangulation of a (d — 1) -dimensional sphere using n 
vertices whose dual graph has diameter 2{n — d) or greater? 
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Even though the Topological Hirsch Conjecture is false in general, it would be interesting 
to know if there are any non-trivial families of spheres where it is true. 

Open Problem 2.4.15. Are there any non-trivial families of triangulated spheres for which 
the Topological Hirsch Conjecture holds? 
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CHAPTER 3 



Geometric Combinatorics of Transportation Polytopes 

This chapter discusses new results on the combinatorial properties of 2-way and 3- 



way transportation polytopes. In Section 1.5 we discussed our process of classifying all 
transportation polytopes of a certain size. Based on the data we collected (see Appendix |A|) , 
we discovered and proved the following results: 

Theorem 3.0.1. The number of vertices of a non-degenerate p x q classical transportation 
polytope is divisible by the greatest common divisor gcd(p, q) of p and q. 

Theorem 3.0.2. The p x 2 x 2 planar transportation polytopes are in 1-1 correspondence 
with the p x 2 classical transportation polytopes, with corresponding pairs being linearly 
isomorphic. 



Theorem 3.0.1 is presented in Section 3.1 and Theorem 3.0.2 is proved in Section 3.2 



Note that Theorem 3.0.2 is best possible in the sense that for p, q > 3 there are many 
more types of planar p x q x 2 transportation polytopes than types of p x q transportation 
polytopes. (See the complete list of 3 x 3 classical and 2x3x3 planar transportation 



polytopes in Tables A. 4 and A. 9 respectively.) 

In Section |3.3[ we discuss the existence of non-regular triangulations for the vertices 
of Birkhoff polytopes B n , for n > 4. We prove that the polytope B4 has non- regular 
triangulations. Together with previously-known results, this proves that the polytope B n 
only has regular triangulations if and only if n < 3. 

We present new bounds for the diameters of transportation polytopes and subpolytopes 



of them. In Section 3.4 we construct infinitely-many Hirsch-sharp transportation polytopes. 



In Section 3.5, we prove the Hirsch Conjecture for p x 2 classical transportation polytopes. 



In Section 3.6 we prove the following diameter bound for 3-way transportation polytopes 
defined by 1-marginals: 
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Theorem 3.0.3. The graph of every 3-way transportation polytope of size p x q x s defined 
by 1-marginals has diameter at most 2(p + q + s — 2) 2 . 

This bound was first given in |97j . proved in collaboration with De Loera, Onn, and 



Santos (see Theorem 2.3.28). If the Hirsch Conjecture (Conjecture 2.0.1) is true, then the 
diameters of 3-way transportation polytopes of size p x q x s defined by 1-marginals are at 
most p + q + s — 2. 

Lemma 3.0.4. Assume the Hirsch Conjecture. Then the diameter of every 3-way transporta- 
tion polytope of size p x q x s defined by 1-marginals is at most p + q + s — 2. 

Proof. Let P be a 3-way px qx s axial transportation polytope defined by 1-marginals. 
The dimension of P is pqs — (p + q + s — 2). Since facets can only be obtained by the 
inequalities of the form scjj * > 0, the number n of facets of P is bounded above by pqs. By 
the Hirsch Conjecture is true, the diameter of P is no more than n — d, which is no more 
than pqs — (pqs — (p + q + s — 2)). □ 

Remark 3.0.5. Note that the "converse" does not prove the Hirsch Conjecture is true for 
3-way axial transportation polytopes defined by 1-marginals. That is to say, a result that 
the diameter of these polytopes is no more than p + q + s — 2 does not prove the Hirsch 
Conjecture is true if the p x q x s polytope has strictly less than pqs facets. 

A similar result for the graph of a p x q classical transportation polytope was given by 
Brightwell et al. (see |51] ). who proved an upper bound of 8{p + q — 2) for the diameter. 
More recently, Hurkens (see |162j ) has obtained a bound of 3(p + q — 1), a factor of three 
away from the predicted value of the Hirsch Conjecture. 

As we observed, [51J provided the first linear bound for the diameter of the graphs 



of 2-way transportation polytopes. Theorem 3.0.3 provides a quadratic bound for 3-way 
transportation polytopes defined by 1-marginals and, moreover, a sublinear one if we assume 
that the three parameters p, q, and s are approximately the same. (Observe that the number 
of facets of a 3-way transportation polytope is bounded above by the product pqs of its size 
parameters) . 

Bounding the diameter of 3-way transportation polytopes is particularly interesting 
because of the following results proved by De Loera and Onn in |103j : 
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(1) Any rational convex polytope can be rewritten as a face F of a 3- way transportation 
polytope of size p x q x s denned by 1-marginals. The sizes p,q,s, the 1-marginals 
u, v, w, and the entries that are prescribed to be zero in the face F can be 
computed in polynomial time on the size of the input. 

(2) More dramatically, any rational polytope is isomorphically representable as a planar 
3-way transportation polytope. 



That is to say, a version of Theorem 3.0.3 for the 3-way transportation polytopes by 



2-marginals, or a version for 3-way transportation polytopes defined by 1-marginals that 
allows one to prescribe some variables to be zero, would provide a polynomial upper bound 
on the diameter of the graph of every convex rational polytope. Another consequence of 



these results is that the method of Section 1.5 for enumerating all combinatorial types of 



planar 3-way transportation polytopes, yields, in particular, an enumeration of all types of 
rational convex polytopes. 



We close with Section 3.7, which proves new diameter bounds for network flow polytopes, 
which are subpolytopes of transportation polytopes. 

Let us finally mention that our systematic listing of non-degenerate transportation 
polytopes of small sizes provides the solution to at least four open problems and conjectures 
about transportation polytopes stated in the monograph [288J: 

(1) In [T84|, Klee and Witzgall prove that the largest possible number of vertices 
in classical transportation polytopes of size p x q is achieved by the generalized 



Birkhoff 2- way polytope (see Definition 1.4.17 on page 34). Problem 32 on page 400 



of [288J conjectured that the same holds in general. 



But in Example 3.0.6 we provide an explicit counter-example of this for planar 



3-way transportation polytopes. (See the definition of the generalized Birkhoff 



3-way planar polytope in Definition 1.4.20 on page |36|) 



(2) Question 36 on page 396 of |288j asked: Is it true that every integer of the form 
(p — l)(q— l)(s — 1) +t, where 1 < t <pq-\-ps + qs—p — q — s, and only these integers, 
can equal the number of facets of a non- degenerate planar 3-way transportation 
polytope of order p x q x s defined by 2-marginals, where p, q, s > 2 ? 

For the case p = q = 2 and s = 3, the conjecture asks if every integer from 3 to 
11, and only these integers, equal the number of facets of non-degenerate 2x2x3 
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planar transportation polytopes denned by 2-marginals. Table A. 8 on page |137| 
answers the question negatively: the number n of facets can be 3, 4, 5, or 6 (and 
these do indeed occur), but 7 through 11 are in fact missing. 

(3) Similarly, Conjecture 33 on page 400 of [288] asked: Is it true that every integer 
from 1 to pq + ps + qs — p — q — s + 1 , and only these numbers, are realized as the 
diameter of a planar 3-way transportation polytope defined by 2-marginals of order 
p x q x s? 

The same case p = q = 2, and s = 3 shows that this is false. The transportation 
polytopes obtained are polygons with up to six sides, hence of diameter at most 
three, instead of 10. 

(4) Open problem 37 on page 396 of [288J asks whether the number Jq of vertices of 
every p x q x s non- degenerate 3-way planar transportation polytope defined by 
2-marginals satisfies (p — l)(q — l)(s — 1) + 1 < /q < 2(p — l)(q — l)(s — 1). 

We show the answer is no even in the case of non-degenerate 2x2x4 planar 
transportation polytopes. 



In addition to the four solved problems above, Theorems 1.5.2 and 3.0.1 are initial steps 



on the solution of Problem 25 in page 399 of [288]. It asks to find the complete distribution 
of possible number of vertices for transportation polytopes. 

Example 3.0.6. Here is an application of this method, which gives a counter-example to 
open problem 37 of |288j. The 2-marginals U, V , and W below dehne a 3 x 3 x 3 planar 
transportation polytope which has more vertices (270 vertices ) than the generalized Birkhoff 
3-way planar polytope, with only 66 vertices: 



U 



164424 324745 127239 
262784 601074 9369116 
149654 7618489 1736281 



V 



163445 49395 403568 
1151824 767866 8313284 
1609500 6331023 1563901 
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W 



184032 123585 269245 
886393 6722333 935582 
1854344 302366 9075926 



Based on the data collected from the enumeration process, we conjecture the following 
to be true: 

Conjecture 3.0.7. The graph of every non-degenerate p x q transportation polytope has a 
Hamiltonian cycle if pq > 4. 

Conjecture 3.0.8. If P is a non-degenerate 3-way transportation polytope of size px q x s 
(p, q, s > 3) defined by 1-marginals, then the diameter of its graph G(P) is equal to n — d, 
where d = pqs — p — q — s + 2 is the dimension and n < pqs is the number of facets of P. 

Among p x q classical transportation polytopes (p, q < 5), there are non-degenerate 
(i-polytopes where the diameter of the graph G(P) is strictly less than n — d, where n < pq 
is the number of facets. (See |175j .) 

3.1. The number of vertices of p x q classical transportation 
polytopes 



In this section, we prove Theorem 3.0.1, which says that the number of vertices of a non- 
degenerate p x q classical transportation polytope is divisible by gcd(p, q). Recall that A^ 
denotes the cf-dimensional simplex, which has d+ 1 vertices. For ease of notation, we will also 
denote the ci-simplex with d+l vertices by D^+i- In particular, by this shift of subscript, we 
have .Drf+i = for all d > 0. The first observation, already hinted in Example |1.5. 4 is that 



the vector configuration A p ^ q associated to these transportation polytopes is (a cone over) 
the set of vertices of the product D pq = D p x D q of two simplices D p and D q of dimensions 
p—1 and q — 1, respectively. So, we are interested in the cardinalities of chambers in the 
product of two simplices. Here and in what follows we call the cardinality of a chamber c of 
A p ^q the number of bases of A Ptq that contain the chamber c. We denote it by \c\. The proof 



of Theorem 3.0.1 consists of the following two steps, which are established respectively in 



the two lemmas below: 
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b+ = {2.3,5} 6 C+ 
6 n = {3,5}, i>+ = 2 



1 




^5 



i 



Figure 3.1. A cross-section of the chamber complex of some cone with two 
adjacent chambers 

• There is a "seed" chamber in D Ptq whose cardinality is indeed a multiple of gcd(p, q). 

• The difference in the cardinalities of any two adjacent chambers of D p x D q is a 
multiple of gcd(p, q). 

Since the chamber complex is a connected polyhedral complex (where two adjacent chambers 
are divided by a hyperplane supported on the vector configuration) the two lemmas settle 
the proof. 

Let us define the lexicographic chamber of D p ^ q recursively as the (unique) chamber 
incident to the lexicographic chamber of D pq -i- The recursion starts with which 
is an (p — l)-simplex and contains a unique chamber. Observe that the definition of the 
lexicographic chamber is not symmetric in p and q. For example, the lexicographic chamber 
of the triangular prism D^^ is the one incident to a basis of the prism, and has cardinality 
3. The lexicographic chamber of Z?2,3 is incident to one of the edges parallel to the axis of 
the prism, and has cardinality four. 

Lemma 3.1.1. The lexicographic chamber of D P)q is contained in exactly pi -1 simplices. 

Proof. The cardinality of the lexicographic chamber of D Ptq equals the cardinality of 
the lexicographic chamber of _D Pj(? _i times the number of vertices of D P)C{ not lying in its 
facet D Pj(? _i. The latter equals p. □ 

When moving from a chamber c_ to an adjacent one c+ we "cross" a certain hyperplane 
rl spanned by all except one of the elements of any basis containing c+ but not containing 
c_ . Let us denote by C+ and C_ the subsets of A p>q lying in the sides of fi containing c + 
and c_ respectively. (Remember that, in our case, A P:q equals the set of vertices of D Ptq .) 
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Observe also that the common boundary cq C T~L of c+ and c_ is a chamber in the vector 
configuration A pA n H. 

Lemma 3.1.2. Let A p ^ q is the set of vertices of D PA . Then, 

(1) | c+ |-|c-| = M(|c+|-|cq). 

(2) \C + \ — |C_| is a multiple of gcd(p, q). 

Proof. A basis 6+ contains c+ but not c_ if and only if 6 + is of the form bo U 
where bo is a basis of BOH containing Co and v+ is an element of C+. This and the analogous 
property for c_ proves the first part. 

For the second part, we restate a few facts in the terminology of oriented matroids. This 
makes the proof easier to write. (For, details see [39 or |42j.) 

• In oriented matroid terminology a pair (C+, C_) consisting of the subconfigurations 
on one and the other side of a hyperplane % spanned by a subset of A p ^ q is called a 
cocircuit of A p ^ q . That is, part 2 is a statement about the cocircuits in the oriented 
matroid M Pt q associated to the vertices of the product of two simplices. 

• The oriented matroid M p>q coincides with the one associated to the complete 
directed bipartite graph K p ^ q (i.e., the complete bipartite graph with all of its 
edges oriented from one part to the other). Thus, part 2 is a statement about the 
cocircuits in the oriented matroid of the directed bipartite graph K pq . 

• The cocircuits of a directed graph G = (V, E) are all read off from cuts in the graph. 
By this we mean that the vertex set V is decomposed into two parts (V+, V_). The 
cocircuit (C+, C_) associated to the cut (V+, VL) has C + consisting of all the edges 
directed from to V— and C_ consisting of all the edges directed from V- to V+. 

Using the dictionary between the directed graph K pq and the product of simplices we 
can finish the proof. Let (V+,V-) be a cut in the complete directed bipartite graph K p ^ q . 
Since our graph is bipartite, we have V + and V- naturally decomposed as U and 
u respectively. The sizes of C + and C_ are then: 

|C7 + | = |K? ) |-|^ ,) | and \C\ = \v[ p) \-\vi q) \. 
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Now, using that (V^l + \ V^ p) \ = p and + \V^ q) \ = q we get: 

\c+\ - |c_| = \vi p) \ • (q - \vi q) \) - \vi q) \ • (p - |f) p) |) = |y) p) | • q - \vi q) \- P , 

which is clearly a multiple of gcd(p, q). □ 

3.2. Identical classes of transportation polytopes 



Here, we present the proof of Theorem 3.0.2 which says the p x 2 x 2 planar transportation 
polytopes are linearly isomorphic to the p x 2 classical transportation polytopes. (The 
theorem is obtained as joint work with De Loera, Onn, and Santos.) First, we prove the 
following lemma: 

Lemma 3.2.1. The planar 2 x p x q transportation polytopes are exactly the p x q trans- 
portation polytopes with bounded entries. 



Proof. Every planar 2 x p x q transportation polytope 

■ ^ y x i,j,k — ^i,i> ^ \ — ^l,i:> ^l,j,k ~i~ %2,j,k — ^^j,k 



P 



k j 

is linearly isomorphic to a p x q transportation polytope with bounded entries, 



Q = { ( x l,j,k) G M>g px<? : y^ai^fc = C/ij, ^xij,fc = Pi,*,, xij,*, < W j)k } ■ 

k j 

via the projection R 2x p Xi ? — y W x i taking (xij k) i— )■ (xij fc), which maps P bijectively onto Q. 
Conversely, every pxg transportation polytope Q with bounded entries is linearly isomorphic 
to a planar 2 x p x q transportation polytope P by defining U^j ■= (Ylk Wj,k) — Uij for 



j = 1, . . . ,p and V 2 ,fc := (Ylj W j,k) ~ Vi,k for k = 1, . . . , q 



□ 



Proof of Theorem I3.Q.2L Consider any planar 2 x 2 x p transportation polytope 



P = (*y,») G 4o 2XP : 



y] x i,j,k = Uij, Xi t i } k + a?i,2,fc = V^fc, Xi,j,fc + X2j,fc = Wj,fe > 

fc J 
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defined by 1-marginals. The equations of the last two types imply that for each k we can 
express all the Xi j k in terms of Xx t \ k as follows: 

%l,2,k = Vl,k — %l,l,k, 
X2,\,k = W ljk -X 1>l7 k, 

X2,2,k = xi,i,k + W 2 ,k ~ Vi t k = a?i,i,fe + V2,k - W\,k- 
In particular, P is linearly isomorphic to its projection 



Q = (zi.i,*) G Rlf xp ■ 



k ) 



where a k = max{0,Wi >A - V 2jk } = max{0,V 1>k - W 2y k] and j3 k = xnhL{V 1>k ,W 1>k }. Now, 
by applying a translation to Q, there is no loss of generality in assuming that a k = for 
all k. Then Q is a 1-way transportation polytope with bounded entries, isomorphic (by 



Lemma 3.2.1) to a p x 2 classical transportation polytope. 



Conversely, any p x 2 classical transportation polytope of the form 



I k 



X j,k = Uj, X ljk + X 2 ,k = Vk 



defined by marginals u = (u\,u 2 ) G M> and v = (v±, . . . , v k , . . . ,v p ) G M> is linearly 
isomorphic to the following planar 2 x 2 x p transportation polytope: 



P 



, \ ^_ Tn)2x2xp J2k x l,j,k — J2k x 2,3-j,k — Uj, 

{Xi,j,k) e M >0 y ■ 



> . 



Si X iA,k = Si X i,2,fc = Sj x l J,fe = Sj X ^,j,k = V k 

The equations relating the solutions of Q to those of P are Xj ik = xij ;k = x 2 ^-j, k - □ 



The above result is the best possible since the list of 3 x 3 classical transportation 



problems presented in Table A. 4 on page 136 is not the same as the list of 2 x 3 x 3 planar 



transportation polytopes presented in Table A. 9 on page 137 



3.3. Regular triangulations and Birkhoff polytopes 



The computation of triangulations of the pth Birkhoff polytope is related to the problem of 
generating apxp doubly stochastic matrix uniformly at random (see, e.g., |65| . or }63j where 
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Cappellini et al. use a probability measure on B p to estimate the volume of the Birkhoff 
polytope). In this section, we study the existence of non-regular triangulations for Birkhoff 
polytopes. (Recall that the pth. Birkhoff polytope B p , introduced in Definition 



1.4.14 



on 



page 32, is the convex hull of the p\ permutation matrices of size p x p.) In 1996, De Loera 
(sec [90 ) proved that the product A3 x A3 of two tetrahedra has non-regular triangulations. 
Using this, one can show that the Birkhoff polytope B p has non-regular triangulations 
whenever p > 8. De Loera, Rambau, and Santos (see |105j ) recently used the results in [90J 
to show that the Birkhoff polytopes Bq and B7 have non-regular triangulations. 

Here, we prove that non-regular triangulations of the Birkhoff polytope B4 exist. The 
polytope B2 is a line segment and the polytope B3 is a 4-polytope with 6 vertices, which 
implies that it has only two triangulations, both of which are regular (see, e.g., Section 
2.4 of [105J). Thus, by computing ranks, we know that the polytope B p only has regular 
triangulations when p < 3. Combined with the results in this section, this tells us that the 
Birkhoff polytope B p has non-regular triangulations if and only if p > 4. 

We first define triangulations, following the book |105j . Since we only consider distinct 
points in convex position (i.e., no point is the convex combination of the remaining points), 
we will not need to give the definitions in their most technical forms here. 

Definition 3.3.1. Let A be a finite collection of distinct points in IR C in convex position. 
Let P = conv(^4) and denote by d the dimension of the polytope P. Then, a collection T of 
simplices o\ , . . . , erg is a triangulation of A if: 



(1) The collection T is a simplicial complex: that is, if the simplex a is in T and r is a 
face of a, then r is in T '■ 

(2) The vertex set of each d-simplex a G T is a subset of A. 

(3) The union of all simplices in T is P , namely: Uj=i a i = conv(A). 

(4) For any two simplices ai and Oj in T , their intersection r = Oj belongs to T 
and is a face of both a, and Uj 

In this section, we study triangulations that are regular. Let w : A — > M be a function 
that assigns a real number w(a) to each point a in A. The function w is called a lifting. The 
lower envelope of the lifting w of A are the facets of the polytope conv{(a, w(a)) \ a G A} 
that are visible from below. (The facets of the polytope conv{(o, w(a)) \ a G A} that are 
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"visible from below" are those facets whose outward normal vector £ = (£1, . . . , £c+i) has the 
property that the last coordinate £ c+ i < 0. 

Definition 3.3.2. Let A be a finite collection of distinct points in M c in convex position. A 
triangulation T of A is called regular if it can be obtained by projecting the lower envelope 
of a lifting w of A to M c+1 . 

Theorem 3.3.3. The fourth Birkhoff polytope B4 has a non-regular triangulation. 

To prove this theorem, we consider the face F of the Birkhoff polytope B4 given by the 
additional equations x\\ = £4,4 = 0. The 14 vertices of this 7-dimensional face F of B^ are 



given in Figure 3.2 



X a 



10 
1 
10 
10 

10 
10 
1 
10 

10 
10 
1 
10 

1 
10 
10 
10 



X r , 



x b 



x P 



x h 



Xu 



1 
10 
10 
10 



10 
10 
1 
10 

10 
1 
10 
10 

10 
10 
1 
10 

1 
10 
10 
10 



Xn 



X r , 



Xf 



X; 



x, 



1 
10 
10 
10 



10 
1 
10 
10 

10 
1 
10 
10 




1 



10 
10 

1 
10 
10 
10 



Figure 3.2. The 14 vertices of the face F. 



After homogenization (see, e.g., [105] or 12901 ). we obtain A, the vector configuration 
given by the columns of the matrix shown in Figure |3.3| 

Let T be the triangulation of the vector configuration A (respectively, point configuration 
{X a , . . . ,X n }) whose highest- dimensional cones (respectively, simplices) are displayed in 



Figure 3.4 In Figure |3^4| each 7-dimensional simplex (or cone over the simplex, for the 
vector configuration) is defined by the indices of its vertices. For instance, the 7-dimensional 
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Figure 3.3. The vector configuration A, as the columns of a matrix. 



o-l : 


abcdefgi, 


02 : 


abdefghi, 


03 : 


acdefgik, 


0-4 : 


adefghik, 


05 : 


aefghikl, 


06 : 


adeghikl, 


07 : 


afghijkl, 


08 : 


acdefikm, 


cr 9 : 


adefhikm, 


010 


: acdfijkm, 


0n 


: adfhijkm, 


012 


: aefhiklm, 


CT13 


: deghiklm, 


014 


: adehiklm, 


015 


: acdfjkmn, 


016 


: adfhjkmn, 


017 


: aefhilmn, 


018 


: afhiklmn, 


019 


: adhiklmn, 


020 


: afhijkmn, 


021 


: adhijkmn, 


022 


: afhijkln, 


023 


: abcdfijk, 


024 


: abcdfgik, 


025 


■ abfghijk, 


026 


: abghijkl, 


027 


: abdfhijk, 


028 


: abdfghik, 


029 


: abdghikl, 


030 


: abhijkln, 


031 


: abdhikln, 


032 


: abdhijkn. 



Figure 3.4. The highest-dimensional cones o\, . . . , 032 in the triangulation T 
simplex o\ is the convex hull of the eight points X a , Xb, X c , X4, X e , Xf, X g , and JQ. In 



Appendix [Bj we prove that T is a triangulation of A. (See page 138 ) 

We will prove that the triangulation T of A is non-regular by considering a Gale transform 

of A. A Gale transform B of A is given by the columns of the matrix shown in Figure |3.5| 
We use the same indices for the vectors in B that we used for A. For example, the 



first column in (3.2) defines the vector Y a = (0,0,0,-1,0,-1) G Mr, the second column 
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(3.2) 
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-1 1 



1 



-1 




1 



-1 





1 
1 
1 
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1 1 



D 1 
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-1 

-1 







-1 





1 
10 10 
10 10 
1 1 

1 



-1 



Figure 3.5. The Gale transform B. 

defines the vector YJ, = (0,0,1,1,1,1) € M 6 , the third column defines the vector Y c = 
(1, 0, —1, 0, 0, 1) S M 6 , and so on. We define 32 full-dimensional simple cones C±, . . . , C32 in 
M 6 . Each cone is defined by "complementation of indices" of the corresponding simplex. 
For example since a~i is the convex hull the X z with z in {a, b, c, d, e, /, g, i}, the cone C\ is 
generated by the vectors Y z , with z in {a, b, . . . , n} \ {a, b, c, d, e, f, g, i} = {h,j, k, I, m, n}. 
(That is to say, the cone C\ is just the positive orthant {y E M 6 | yi, . . . , y% > 0}.) Similarly, 
since the simplex 02 is given by the indices abdefghi, the cone C2 is the set of all positive 
linear combinations of Y c , Yj, Y^, Y[, Y m , and Y n . 

It is known (see, e.g., Chapter 5 of [TU5] . or [273], and part of Lemma [L5T5 



on page 51 ) that the triangulation T of A is regular if and only if the intersection 

32 



D = p| relint(q 



i=l 

is non-empty. It is easy to check the following facts: 

• All points y in the cone C\ satisfy the inequality 2/3 > 0. 

• All points y in the cone C%q satisfy the inequality — y2 — ys + 2/6 > 0. 

• All points y in the cone C13 satisfy the inequality —2/4 + 2/5 > 0. 

• All points y in the cone C32 satisfy the inequality 2/2 — 2/3 + 2/4 — 2/6 > 0. 

In fact, each of the four inequalities above is a facet-defining inequality of their respective 
cone. Now, if there is a point y G M 6 that belongs to -D, then y must satsify: 

2/3 > 0, 
-2/2 - 2/5 + 2/6 > 0, 
-2/4 + 2/5 > 0, 
2/2 - 2/3 + 2/4 - 2/6 > 0. 
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But the sum of the left-hand sides is the same as the sum of the right-hand sides, so this 
is an inconsistent system. In particular, D is empty, which proves the triangulation T is 
non-regular. (In fact, with further analysis one can show that the intersection of all 32 cones 
is the single ray {y E M 6 | A(0, 0, 0, 0, 0, 1), A > 0}.) We can complete this triangulation T of 
A to a triangulation of by pulling or placing (see [105]). Therefore the fourth Birkhoff 
polytope -B4 has non-regular triangulations. As an immediate corollary, we get: 

Theorem 3.3.4. The Birkhoff polytope B p has non-regular triangulations for p > 4, and all 
triangulations of B p are regular for p < 4. 

Remark 3.3.5. Non-regularity of a triangulation can also be checked using Theorem 8.3 
in |269j . The theorem, which is proved using linear program duality, says that the regular 
triangulations of A are the initial complexes of the toric ideal (To show that a triangu- 
lation is non-regular, one can show that the triangulation A^(Ia) is not T ■ See [269] for 
details.) 

3.4. Many Hirsch-sharp transportation polytopes 



We already have seen the results of [133J, [159J, and [160J, summarized in Theorem 2.2.6 
which says that there are "many Hirsch-sharp polytopes." That is to say, there are Hirsch- 
sharp d-polytopes with n > d facets whenever d > 7. Does this remain true if we only look 
at classical and 3-way axial transportation polytopes? In this section, we prove that there 
are many Hirsch-sharp transportation polytopes. Recall that a polytope is Hirsch-sharp if it 
meets the Hirsch Conjecture with equality. 



First, we prove a "permuted version" of Lemma 1.4.4 It says that the northwest 
corner rule algorithm can be done out of order by specifying permutations of the rows and 
columns. See Exercise 17 in Chapter 6 of |288j . To make the proof easier, we prove it for 
non-degenerate transportation polytopes. 

Lemma 3.4.1. Let the marginals u 6 M> and v £ M> define a non-degenerate p x q classical 
transportation polytope P 7^ 0. Let a be a permutation of [p] and t be a permutation of [q\. 
Then the polytope P has a vertex x with aVfoWg) > ^- 
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Proof. We construct a point x £ P using the following modified form of the well- 
known northwest corner rule algorithm. Let a be any permutation on [p] and let r be any 
permutation of [q]. 

Let x a i p ),T(q) = mm i u a(p) , v r(q)} ■ If the minimum is obtained at u a ^, set x a r p \j = 
for all jf ^ r(q) and replace t> T (g) with v T i q -\ — u a ( p y The rest of the point x = {xij) is 
obtained recursively as a point in a (p — 1) x </ classical transportation polytope. Similarly, 
if the minimum is obtained at i> r ( ? ), set £i, r (g) = f° r an * 7^ an d replace u a ^ with 
w CT (p) — v T (q)- The rest of the point x is obtained as a point in a p x (g — 1) transportation 
polytope. □ 



Note that this generalizes Lemma 1.4.4, which is the special case when a and r are both 
their respective identity permutations. 



£1,1 


£1,2 


£1,3 


£1,4 


£1,5 


£2,1 


£2,2 


£2,3 


£2,4 


£2,5 


£3,1 


£3,2 


£3,3 


£3,4 


£3,5 



Figure 3.6. Layout of the coordinates for the generalized Birkhoff polytope 
of size 3x5. 



Example 3.4.2. As an example, let P be the 3x5 generalized Birkhoff polytope. That is to 



say, we are looking at the polytope whose variables are Xij arranged according to Figure 3.6 
satisfying the summation conditions that each row must sum to 5 and each column must 
add up to 3. Let a be the permutation on [3] = {1, 2, 3} 



(3.3) 

and Jet r be the permutation on [5] 
(3.4) 





To Gil in the table shown in Figure \3^6\ according to the proof of Lemma 3.4.1, we start 
with the (a(p),r(q)) = (cr(3),r(5)) = (1,4) entry of the table. This entry is highlighted 
in boldface in Figure \3.7\ What is the largest entry that can fit here? Since the rows 
must sum to 5 and the columns must sum to 3, the largest possible value here is 3. 
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Xl,l 


X\,2 


Xl,3 


Xl,4 


a?i,5 


X2,l 


X2,2 


X2,3 


X2,i 


^2,5 


X3,l 


X3,2 


X3,3 


X3A 


£3,5 



Figure 3.7. The first entry to fill in is x 1,4. 

Then, since the minimum of min{u a u,Y T tq\} was obtained at v T ( q ), the rest of our point is 
found as a point in a 3 x 4 transportation polytope. The next entry we examine is the 
{cr(p),T(q — 1)) = (<r(3),r(4)) = (1,3) entry (which we can see has to be the minimum of 
5 — 3 = 2 and 3) of our 3x5 table. When the whole algorithm completes, we obtain the 



vertex shown in Figure 3.8 Compare this vertex to the one obtained by using the identity 
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CO 





2 
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1 









Figure 3.8. The vertex obtained by using a and r. 



permutations. See Figure 3.9 



CO 
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CO 
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CO 



Figure 3.9. The vertex obtained by the normal northwest corner rule algorithm. 



Theorem 3.4.3. Let p and q be relatively prime with min{p, q} > 3 and q > 2p. Then the 
p x q generalized classical Birkhoff polytope P is Hirsch-sharp. 

Proof. Let P be the p x q generalized classical Birkhoff polytope. Then the dimension 
of P is (p — l)(q — 1) and the number of facets is bounded above by pq. By Theorem 3.1 
in Chapter 6 of |288j . the polytope P has exactly pq facets. Thus, the conjectured Hirsch 
bound for the diameter of P is p + q — 1. 

We now construct two vertices v and v' of P whose distance is at least p + q — 1. Let 
v be the vertex obtained from Lemma |3.4.1 using the identity permutations for a and r. 
Let a' be the identity permutation on \p\. If g is odd, let r' be the permutation of [q] that 
sends 1, . . . , q (respectively) to [|] , . . . , q, 1, . . . , |_§ J (respectively). If q is even, let r' be the 
permutation that sends l,...,q (respectively) to | + 1, . . . ,q,l,. (respectively). Let v' 



be the vertex obtained from Lemma 3.4.1 using the permutations a' and t' . 
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Since p and q are coprime, the transportation polytope P is non-degenerate. Now we 
note that the supports of v and v' are disjoint (which is easy to see since q > 2p). Thus, 
the distance between the vertices v and v' is at least |supp(«)| = |supp(u')|, which (by 



Corollary 1.4.7) is p + q — 1 since P is non-degenerate. □ 



Remark 3.4.4. The condition q > 2p can be dropped in Theorem 3.4.3 but the proof 
becomes more difficult. (The definition for the permutations a and r need to be given with 
more caution.) The case of what occurs when p and q are close to each other is clear by 



example: see Figures 3.8 and 3.9 for the 3x5 case. The two vertices shown in these figures 
have disjoint support. Thus, in this example, we see that the diameter of the generalized 
3x5 Birkhoff polytope is at least 7. So, we have constructed a Hirsch-sharp 8-polytope 
with 15 facets. 

3.5. The diameter of p x 2 transportation polytopes 

In this section, we show that the Hirsch Conjecture holds for p x 2 transportation polytopes. 

Theorem 3.5.1. Let P ^ be a classical transportation polytope of size p x 2 with n < 2p 
facets. Then, the dimension of P is d = p — 2 and the diameter of P is at most n — d. 

To prove this theorem, we note that the coordinate-erasing projection of P to the 
coordinates xn,X2 i, . ■ . ,x p i of the first column shows that P is the intersection of a 
hyperplane with a rectangular prism. (In particular, if the intervals are all equal and one has 
a cube, then the Minkowski sum of two consecutive hypersimplices D(p,i) and D(p,i + 1) 
can be realized as a transportation polytope of size p X 2.) After an affine transformation, 
the polytope P is the intersection of a hyperplane and a cube. (The transformation takes 
the cube [0, ui] x ■ • ■ x [0, u p ] to the cube [0, l] p . That is to say, the ith coordinate m in 



the cube [0, l] p is xn/ui.) Thus, to prove Theorem 3.5.1, we will prove that the Hirsch 
Conjecture holds for polytopes obtained as the intersection of a cube and a hyperplane. 

Fix a dimension d € N. Let H = {x G M. d | a\x\ + • • • + adXd = b} be the hyperplane 
determined by the non-zero normal vector a = (ai, . . . , ad) and the constant b G M. Let 
denote d-dimensional cube with 0-1 vertices. Then, let P denote the polytope obtained as 
their intersection P = □<£ n H. 
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If the dimension of the polytope P is less than d — 1, then P is a face of D^. In that 
case, P itself is a cube of lower dimension, so we assume that the polytope P is of dimension 
d — 1. We may also assume that the polytope P is not a facet of the ci-cube, so that H 
intersects the relative interior of D^. 

Without assuming any genericity, a simple dimension argument shows that the vertices 
of the polytope P are either on the relative interior of an edge of the cube or are vertices 
of the cube. We assume that H is sufficiently generic. Then, no vertex of the cube will 
be a vertex of P. For each vertex v of P, we define its side signature cr(v) to be a string of 
length d consisting of the characters *, 0, and 1 by the following rule: 



(3.5) a(v)i 



if Vi = 0, 

1 if V{ = 1, 

* if < Vi < 1. 



By genericity, it cannot be the case that there are two vertices of P with the same side 
signature. Indeed, if there were two distinct vertices v and w with the same side signature, 
then P will contain the entire edge of the cube containing them both, and v and uu will not 
be vertices. 

Let Hifi denote the hyperplane {x £ M. d \ Xi = 0} and let Hi\ denote the hyperplane 
{x 6 M d | Xi = 1}. If there is an i G [d] such that the hyperplane H does not intersect Hp 
nor Hi i, then we can project P to a lower-dimensional face of Id- Thus, for each i € [d], we 
can assume that H intersects at least one of Hi^ or H^i. 

Given two vertices v = (yi, . . . , Vd) and w = (w±, . . . , Wd) of P = Id n H , we define the 
Hamming distance between them based on their side signatures: 

d 

(3.6) hamm(t),u)) = hamm((j(t))i, a(w)j), 

i=i 

where 

hamm(0, 1) = hamm(l, 0) = hamm(l, *) = hamm(*, 1) = hamm(0, *) = hamm(*, 0) = 1 



and 



hamm(0, 0) = hamm(l, 1) = hamm(*, *) = 0. 
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Lemma 3.5.2. Let P defined as above using a sufficiently-generic hyperplane H. Let v and 
w he two vertices of P. Let f(P) denote the number of facets of P. 

If v and w have the * in the same coordinate and P does not intersect either of 
the two facets in that direction, i.e., there is an i such that <y{v)i = * = cr(w)i and 
P n H lA = = P n H ijQ , then f(P) > (d - 1) + hamm(t), w) - 1. 

Otherwise, f(P) > (d— 1) + hamm(t),w). 

Proof. Without loss of generality, we can assume that the side signature cr(v) of the 
vertex v is (*, 0, 0, . . . , 0) and that the side signature a(w) of the vertex w is either of the form 
(0, *, 0, 0, . . . , 0, 1, 1, . . . , 1) with k > trailing ones or of the form (*, 0, 0, . . . , 0, 1, 1, . . . , 1) 
with k > 1 trailing ones, after applying a suitable rotation to the cube. 

In the first case, h&mm.(v,w) = k + 1 and we have at least d "0-facets" and k "1-facets." 

In the second case, we have d — 1 "0-facets" , k "1-facets" and (unless there is an i 
such that a{v)i = * = a(w)i and P n = = P n i?i,o) at least one more facet. Thus, 
f{P) > (d — 1) + k + 1 = (d — 1) + hamm(t>, w), unless we are in the special case, in which 
case f(P) > (d - 1) + k + 1 - 1. □ 

Lemma 3.5.3. Let P defined as above using a sufficiently-generic hyperplane H. Let v 
and w be two vertices of P. Then, there is a pivot from the vertex v to a vertex v' with 
hamm(f', w) = hamm(u, w) — 1. 

Proof. Without loss of generality, we can assume that a(v) = (*, 0, 0, . . . , 0) and that 
a(w) is either (*, 1, 1, . . . , 1) or (1, 1, . . . , 1, *). 

If the side signature a(w) of w is (*, 1, 1, . . . , 1), performing a pivot on the vertex v in 
any one of the d — 1 last coordinates reduces the Hamming distance. 

Otherwise, the side signature a(w) of w is (1, 1, . . . , 1, *). We now describe what can 
occur when pivoting from the vertex v to a new vertex v'. We claim that at least one of 
the d — 1 possible pivots on the vertex v does not put a in the first coordinate of the side 
signature cr(v') of the new vertex v' . Otherwise, the hyperplane H cuts the polytope P as a 
vertex figure: that is to say, the polytope P cuts the corner (1, 0, . . . , 0) of the cube. See 



Figure [3.10| for a picture. 

The remaining kind of pivots on v that result in a new vertex v' give side signatures 
<r(v') of one of the following three forms: 
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Figure 3.10. The vertex v on the horizontal axis (the x\ coordinate increases 
moving to the right) and its neighboring vertices on orthogonal edges of the 
cube. 

(1) The signature cr(v') of the neighbor v' of the vertex v could be 

(1,0,0,. ..,*,0,...,0), 

which reduces the Hamming distance by one. 

(2) The signature cr(v') of the neighbor v' of the vertex v could be 

(*,0,0,... ,0,1,0,0,. ..,0), 

which reduces the Hamming distance by one. 

(3) Otherwise, one remaining pivot could give the side signature 

(1,0,0,. ..,0,*) 

for a(v'). 

This third type of pivot does not reduce the Hamming distance. But if this is the only pivot 
that could give this and the first two kinds of pivots cannot be performed, then all of the 
remaining pivots are the kind that put in the first coordinate of the side signature cr(v) of 
v. But this would imply that H could not have intersected the hyperplane Hd + , and thus P 
would be a (d — l)-cube with one vertex truncated. □ 

By applying an affine transformation to P = n H, we obtain a p x 2 classical 
transportation polytope, thus: 

Corollary 3.5.4. The Hirsch Conjecture holds for p x 2 classical transportation polytopes. 

Open Problem 3.5.5. Prove that the Hirsch Conjecture holds for p x 3 classical trans- 
portation polytopes. 
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Open Problem 3.5.6. Prove that the Hirsch Conjecture holds for polytopes P of the form 
□d n H\ n H2, where is a d-dimensional cube and Hi and Hi are hyperplanes. 

3.6. The diameter of 3-way transportation polytopes defined 
by 1-marginals 



In this section, we prove Theorem 3.0.3, which gives the first quadratic upper bound on 
the diameter of all 3-way p x q x s transportation polytopes defined by 1-marginals. The 
contents of this section is joint work with De Loera, Onn, and Santos. 

Here we consider a 3-way pxqxs transportation polytope P defined by certain 1-marginal 
vectors u, v, and w. Recall that for bounding its diameter there is no loss of generality 
in assuming P non-degenerate, that is, that u, v and w are sufficiently generic. In the 
non-degenerate case, at every vertex V of our polytope, exactly pqs — (p + q + s — 2) variables 
are zero, and exactly p + q + s — 2 are non-zero. As in the case of classical transportation 
polytopes, the set of triplets k) indexing non-zero variables will be called the support of 
the vertex V, denoted supp(V). 

We say that a vertex V of the axial transportation polytope P is well-ordered if the 
triplets (i,j,k) that form its support are totally ordered with respect to the following 
coordinate- wise partial order: 



(3.7) (i,j,k) < (i',j',k f ) if i < i , and j < j , and k < k . 

Observe that a set of p + q + s — 2 triplets satisfying this must contain exactly one triplet 
k) with i + j + k = z for each z = 3, . . . ,p + q + s. Actually, supports of well-ordered 
vertices are the monotone staircases from (1,1,1) to (p, q, s) in the pxqxs grid. 

Lemma 3.6.1. If x, y and z are generic, then the axial transportation polytope P has a 
unique well-ordered vertex V . 

Proof. Existence is guaranteed by the "northwest corner rule algorithm", which fills 
the entries of the table in the prescribed order (see the survey [238j or Exercise 17 in Chapter 
6 of |288j ). More explicitly: let V Pt q :S = min{x p , y q , z s }. Genericity implies that the three 
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values x p , y q and z s are different. Without loss of generality we assume that the minimum 
is z s . Then, our choice of V Pt q tS makes Vij tS = for every other pair The rest of 

our vertex V is a vertex of the pxgx(s-l) axial transportation polytope with margins 
u' = (iti, . . .,u p -i,ui - w s ), v' = (vi, . . .,v g -i,v q - w s ), and w' = (wi, . . . , w n -i). 

Uniqueness follows from the same argument, simply noticing that the support of a 
well-ordered vertex always contains the entry (p, q, s), and no other entry from one of the 
three planes (p, *, *), (*, q, *) and (*, *, s). This, recursively, implies that the vertex can be 
obtained by the northwest corner rule. □ 



Remark 3.6.2. The proof of Lemma 3.6.1 above is the 3-way analogue of Lemma 1.4.4 
Another proof of Lemma 3.6.1 can be obtained using the formalism of chambers developed 
in the previous sections: it is obvious (and is proved in |96j ) that if c denotes a chamber of 
A, and T is a triangulation of cone(A), then there is a unique maximal-dimension simplex 



in T that contains c. Thus, Lemma 3.6.1 follows from the fact that monotone staircases in 
the p x q x s grid form a triangulation of the vector configuration A p ^ tS of axial p x q x s 
transportation polytopes. The latter is well-known, once we observe that A p ^ s is the vertex 
set of a product of three simplices. The triangulation in question is called the "staircase 
triangulation" of it (see Section 2.3 in Chapter 6 of [105JJ. 



Example 3.6.3. To illustrate Lemma 3.6.1 consider the non-degenerate 3x3x3 axiai 



transportation polytope the axial transportation polytope P with: 





= 112 




-- 18 


Y X3 >i,k = 30 
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= 40 


Y Xi > 2 > k = 
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Y Xi > 3 > k = 114 
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i,k 


Y2 Xi 'i' 1 


= 82 


Yl Xi ^ 2 = 


44 


Y Xi J\3 = 34 






i,3 







The unique well-ordered vertex V of the axial transportation polytope P has the non-zero 
coordinates £(1,1,1) = 40, £(i )2 ,i) = 6, £(1,3,1) = 3 6, £(1,3,2) = 3 0, £(2,3,2) = 14, £(2,3,3) = 4 , 
and £(3,3,3) = 30. Note that the non-zero entries ofV are totally ordered (they are presented 



above in increasing order) with respect to (3.7). Figure 3.11 depicts the associated monotone 
staircase. 
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Figure 3.11. A well-ordered vertex and its staircase. 



Our bound on the diameter of the axial transportation polytope P is based on an explicit 
path that goes from any initial vertex V of the axial transportation polytope P to the unique 
well-ordered vertex V . To build this path we rely on the following stratified version of the 
concept of well-ordered vertex. We say that a vertex V of the axial transportation polytope 
P is well- ordered starting at level z, where z is an integer between 3 and p + q + s if: 

(1) For each y = z, . . . ,p + q + s, the support of V contains exactly one triplet k) 
with i + j + k = y. 



(2) Those triplets are well-ordered. (The partial order given in (3.7) is a total order on 
these triplets.) 

(3) All other triplets in the support have entries which are index-wise smaller than or 
equal to those of the unique triplet (io,jo, ko) with iq + jo + = z. 

For example, the only vertex "well-ordered starting at level 3" is the well-ordered vertex 
V . Slightly less trivially, it is also the unique vertex "well-ordered starting at level 4." On 
the other extreme, all vertices that contain (p, q, s) as a support triplet are well-ordered 
starting at p + q + s. Observe that from any vertex of the axial transportation polytope P 
we can move, by a single pivot edge in the sense of the simplex method, to another vertex 
containing any prescribed entry (i,j,k) to be non-zero. In particular, we can move to a 
vertex that has (p, q, s) in its support. So, we can assume from the beginning that (p, q, s) is 
in the support of our initial vertex V, and will add one to the count of edges traversed to 
arrive to V. 

Lemma 3.6.4. IfV is a vertex of the axial transportation polytope P that is well-ordered 
starting at level z £ {5, . . . ,p + q + s}, then there is a path of at most 2(z — 4) edges of the 
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axial transportation polytope P that leads from V to a vertex that is well-ordered starting 
at level z — 1. 

Proof. Let (io,jo, ko) be the unique triplet in the support of V with iq + jo + k$ = p. 
We first observe that there is no loss of generality in assuming that z = p + q + s (that is, 
(*0) JO; ko) = (p, q, s)). This is because the vertices of the axial transportation polytope P 
that are well-ordered starting at level z and agree with V in all the triplets with sum of 
indices greater than or equal to z are the vertices of a non-degenerate i$ x jo x k$ axial 



transportation polytope, obtained as in the proof of Lemma 3.6.1 

So, from now on we assume that V is well-ordered starting at level z = p + q + s. Let 
Si be the set of support triplets in V, other than (p, q, s), that have first index equal to I. 
Similarly, let 52 and £3 be the sets of support triplets that have, respectively, second and 
third indices equal to m and n. 

Our goal is to modify V until one of Si , S2 or S3 becomes empty, but always keeping the 
triplet (p, q, s) in the support. Once this is done, a single pivot step can be used to obtain 
a vertex that is well-ordered starting at (p, q, s) — 1 as follows: Without loss of generality 
assume that Si is empty (the cases when S2 or 53 are empty are treated identically). In 
particular, neither (p, q — 1, s) nor (p, q, s — 1) are in the support. If (p — 1, q, s) is in the 
support then our vertex is already well-ordered starting at p + q + s — 1. If not, we do 
the pivot step that inserts (p — l,q,s). This pivot step cannot remove (p,q,s) or insert 
(p, q — 1, s) or (p, q, s — 1) in the support. (The (p, q, s) coordinate is not removed from the 
support since the entry remains constant in this pivot. The (p, q — l,s) and (p,q,s — 1) 
coordinates remain zero because only non-zero entries of V and the entry (p — l,q,s) change 
in the pivot). This pivot produces a vertex well-ordered starting at p + q + s — 1. Figure 
3.12| gives a picture for this case. 




{p- l,9.s) 
(p, q,s-l 



Figure 3.12. A well-ordered vertex starting at p + q + s — 1. 
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Given a vertex V well-ordered at p + q + s, we specify a sequence of pivots in the graph 
of the axial transportation polytope P to a vertex V such that one of Si, S% or S3 is empty 



for V' . Lemma 3.6.5 below shows how to get such a V in a number of steps bounded above 
by 

2|Si U S 2 U S 3 1 - 3 < 2{z - 3) - 3 = 2z - 9. 

In one more step, that is, at most 2z — 8, we get to a vertex that is well-ordered starting at 
p + q + s — 1. This completes the proof of our lemma. □ 



For Lemma 3.6.5 let us introduce the following notation: 



R 1 :=S 1 \(S 2 US 3 ), R 2 := S 2 \ (Si U S 3 ), R 3 := S 3 \ (Si U S 2 ), 

R12 ■= Si n S2, i?i3 := Si n S3, R23 '.= S2 n S3. 

That is, Ri consists of the elements of Si U S2 U S3 that belong only to Si, and Rij of those 
that belong to S, and Sj. Observe that, by definition, no element belongs to the three Sj's, 
so that Si U S2 U S3 is the disjoint union of these six subsets. 

Lemma 3.6.5. With the above notation and the conditions of the proof of the previous 
lemma, suppose that no Si is empty. Then: 

(1) If both Ri and Rjk are non-empty, with {i,j, k} = {1, 2, 3}, then there is a single 
pivot step that decreases \ Si U S2 U S3 1 . 

(2) If the three R^ 's are non-empty, then there is a sequence of two pivot steps that 
decreases | Si U S2 U S3 1 . 

(3) If the three Ri 's are non-empty, then there is a sequence of two pivot steps that 
decreases |Si U S2 U S3I. 

(4) If none of the above happens, then Si U S 2 U S3 is contained in one of the Si 's, say 
Si. Then, there is a sequence of\Si \ — 1 pivot steps that makes Si U S2 U S3 empty. 

All in all, there is a sequence of at most 2\Si U S2 U S3I — 3 pivot steps that makes some Si 
empty. 

Proof. Let us first show how the conclusion is obtained. We argue by induction on 
I Si U S2 U S3 1, the base case being |Si U S2 U S3I =2, which is the minimum to have no Sj 
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empty. The base case implies we are in the situation of either part (1) or part (4), and a 
single pivot step makes an Si empty. 

If | Si U S2 U S3 1 > 2 and one of the conditions (1), (2) or (3) holds, then we do the 
step or the two pivot steps mentioned there and apply induction. If none of these three 
conditions hold then it is easy to see that (4) must hold. (Remember that we are assuming 
that no Si is empty, and Si = R4U Rij U Rik)- Part (4) guarantees we have a sequence of 
I Si I — 1 < 2IS1US2US3I — 3 pivot steps that makes an Sj empty. 

So, let us prove each of the four items in the lemma. Let a denote the entry (p, q, s). 

(1) Suppose without loss of generality that R12 and R3 are not empty. Let (3 £ R\2 and 



7 £ -R3. The reader may find it useful to follow our proof using Figure 3.13 which de- 



picts the situation. Observe that a = (p, q, s) is the index-wise maximum of (3 and 7. 




Figure 3.13. A layout of entries a, /3, 7 and 5. 



Let S be the index-wise minimum of them. First observe that 5 is not in the support 
of vertex V . Otherwise, we could add ±| min{x Q , xp, x 7 , x$} (e a + eg — ep — e 7 ) to 
V and stay in the axial transportation polytope P. Hence, V would be a convex 
combination of two other points from the axial transportation polytope P (and 
thus not a vertex), parallel to the direction of e a + e$ — ep — e 7 . (Here ei jk denotes 



the basis unit vector in the direction of the variable Xijfc.) By Lemma 1.0.10 the 
feasible point V in P would not be a vertex. Therefore, the triplet d is not in the 
support of the vertex V. 

Next, consider V' = V + min{xp, x 7 } (e a + e$ — ep — e 7 ). Observe that V' has 
different support than V since either f3 or 7 has been removed (not both, because 
xp / x-y by non-degeneracy). Also, since V cannot have support strictly contained 
in that of V, 5 must have been added. That is, the supports of the vertices V and 
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V differ in the deletion and insertion of a single element, which means they are 
adjacent in the graph of the polytope the axial transportation polytope P. As 
desired, when going from V to V the cardinality of 5i U 52 U S3 is decreased by 
one. 



Example 3.6.6 (Example 3.6.3 continued). Consider the vertex V with non-zero 

coordinates £(1,1,2) = 28, £(2,1,2) = 12 > ^(2,2,3) = 6 ; x (i,3,2) = 2 , £(1,3,1) = 82 > 
£(3,3,2) = 2 , and £(3,3.3) = 28. In this example, a = (3,3,3), j3 = (3,3,2), 7 = 
(2, 2, 3), and 5 = (2, 2, 2). After clearing xr, we arrive at the vertex V' with non- 

ZGTO COOrdlTlcbtjGS X ^ 2^ — 28j *^ (J2 1 2) — *^(2 2 3) — 3 2) — 3 1) — §2j 

X (2,2,2) = 2 > and X (3,3,3) = 28 - 

(2) Suppose now that none of the Ri^s is empty, and let (3 G -R13, 7 6 -R23 and 5' G -R12. 
we apply the same pivot as in case one, which makes 5, the coordinate- wise minimum 
of P and 7, enter the support. This pivot does not decrease |5i U 52 U S3I, but it 
leads to a situation where we have 5 S R3 and 5' S i?i2- Hence, we can apply part 
one and decrease |Si U S2 U 53 1 with a second step. 

(4) Let us prove now part (4) and leave (3), which is more complicated, for the end. 
Observe that if S\ U 52 U S3 = 5i but 52 and 53 are not empty, then necessarily i?i2 
and -R13 are both non-empty. While this holds, we can do the same pivot steps as 
before with a (3 G R12 and a 7 G -R13. Each step decreases by one the cardinality of 
-R12 U -R13, increasing the cardinality of R\. The process finishes when R\2 (hence 
52) or i?i3 (hence 53) becomes empty, which happens, in the worst case, in |5i| — 1 
steps. 

(3) Finally, consider the case where the three R^s are non-empty. Let (3 = {p,ji, k\) £ 
Ri, 7 = {i2,q, ^2) £ R2 and 5 = («3,j3,s) G R3, and, as before, a = (p,q,s). 
Figure 3,14| depicts the situation to help the reader with following our proof. Let 



ei and €2 be two triplets of indices with the property that {a, ei, £2} and {f3, 7, 5} 
use exactly the same three first indices, the same three second indices, and the 
same three third indices. For example, let us make the choice ei = (i2,ji,k\) and 



£2 = (^3, J3) ^2) as in the left part of Figure 3.14 By non-degeneracy, the smallest 



value among xp, x 7 and x$ at V is unique. We assume, without loss of generality, 
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Figure 3.14. Possible layouts of the entries a, /3, 7, 5, e± and ei- 



that the smallest among them is xr. Let W be the point of the axial transportation 
polytope P obtained by changing the following six coordinates: 



/ 


— Xa 


+ X/3, 




= X/3 


-Xj3=Q 


J 

X ^ 




- X/3, 


x 's 


= Xg 


~ X/3, 


< 


Xe± 


+ xp, 


< 


X €2 


+ Xp. 



It may occur that e± = 62, as shown in the right side of Figure 3.14 Then we do 
the same pivot except we increase the corresponding entry x ei = x €2 twice as much. 

Observe that one of e\ or £2 may already be in the support of V, but not 
both: Otherwise W would have support strictly contained in that of V, which is 
impossible because V is a vertex and has minimal support. If one of e\ or €2 were 
already in the support of V, or if e± = €2, then W is a vertex and we take V' = W. 
As in the first case, V' is obtained from V by traversing a single edge and has one 
less support element in S\ U S2 U S3 than V: None of e\ or £2 can have a common 
entry with a, since none of /3, 7 and 5 has two common entries with a. 

However, if e\ and €2 are different and none of them was in V, then W has 
one-too-many elements in its support to be a vertex, which means it is in the 



relative interior of an edge E and L = VW is not an edge. See Figure 3.15 
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Moreover, both E and VW lie in a two-dimensional face F. This is so because 
every support containing the support of a vertex defines a face of dimension 
equal the excess of elements it has. In our case, F is the face with support 
support(V) U support (W) = support (V) U {ei, €2}- 



E 




E 

Figure 3.15. The octagon containing segment VW arising when entries e\ 
and €2 are different and none of them was in the support of V 

We now look more closely at the structure of F. Each edge H of F is the 
intersection of F with a facet of our transportation polytope. That is, there is a 
unique variable rj that is constantly zero along H but not zero as we move on F in 
other directions. For example, since e\ and €2 are zero at V but not constant on F 
(they increase along L), V is the common end of the edges defined by ei and €2- 

Our goal is to show that there is a vertex V 1 of F at distance at most two from 
V and incident to the edge defined by one of the variables j3, 7, and 5. At such a 
vertex V we will have decreased | S\ U S2 U 5*3 1 by one, as claimed. The key remark 
is that there are at most two edges of F not produced by one of the variables a, 
[3, 7, 5, e±, and 62- every variable r/ other than those six is constant along L, so it 
either produces an edge parallel to L or no edge at all. In particular, F is at most 
an octagon, as in Figure |3.15| Now: 

• If F has five or less edges, then every vertex of F is at distance one or two 
from V. Take as V either end of the end-point W of L. This works because 
at W one of /3, 7 or 5 is zero, by construction. 

• If F has six or more edges, then the two vertices V and V" of F at distance 
two from V are at distance at least two from each other. So, together they 
are incident to four different edges, none of which is the edge of ei or €2- 
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(Remember that the edges of ei and €2 are incident to V). At least one of 
these four edges is defined by (3, 7, or 5, because there are (at most) three 
other possible edges: the one of a and two parallel to L. 

In either case, at most two edges are needed to go from V to a vertex V' incident 

to an edge defined by one of the variables (3, 7, or S. 

Thus, there is a sequence of at most 2|Si U S2 U S3I — 3 pivot steps that makes at least one 
of Si, S2, or S3 become empty. □ 

Example 3.6.7. To make ideas completely clear, using the same polytope the axial trans- 



portation polytope P as in Example 3.6.3, we consider its vertex V with non-zero coordinates 

v (i,i,3) = 25, «(3,i,i) = 15 > ^(3,2,1) = 6, U(i,3,i) = 61, v (i,3,2) = 26, f( 2 ,3,2) = 18 and ^(3,3,3) = 9. 
Here, a = (3,3,3), p = (3,2,1), 7 = (2,3,2), 5 = (1,1,3), a = (2,2,1), and e 2 = (1,1,2). 
The triplet (3 is not in the support of W, and W has non-zero coordinates 1^(1,1,3) = 19, 
w (i,i,2) = 6, u>(3,i,i) = 15, ^(2,2,1) = 6, ^(1,3,1) = 61, u?(i,3,2) = 26, ^(2,3,2) = 12, and 

w (3,3,3) = 15- 

The vertices of the axial transportation polytope P with support contained in support(V)D 
{ei, €2} form the 4-gon F = VBCD where B is the vertex with non-zero coordinates 

^(1,1,3) = 12,6(1,1,2) = 26, 6(3,1,1) = 2, 6(3,2,1) = 6, 6( 3)3i 3) = 22,6(2,3,2) = l 8 i 6(1,3,1) = 74, 
C is the vertex with non-zero coordinates 

c (l,l,3) = 22,C( 3)ljl ) = 18,C( 2 ,2,1) = 6, C( 3)3 ,3) = 12, C(i j3(2 ) = 32, C(2, 3|2 ) = 12, C( 1)3>1 ) = 58 

and D is the vertex with non-zero coordinates 

d (l,l,3) = 6,^(1,1,2) = 32, d(3,l,l) = 2, d( 2 ,2,l) = 6, ^(3,3,3) = 28, ^(2,3,2) = 12,^(1,3,1) = 74. 

Note W is in the edge E = CD. We let V = D, the endpoint of E closer to V. Thus, we 
use one edge to go from V to V . 

Proof of Theorem I3.Q.3L Starting with any vertex "well-ordered starting at z = 



p + q + s" (which can be reached in a single step) we use Lemma 3.6.4 to decrease one unit 
by one the level at which our vertex starts to be well-ordered until we reach the unique 
well-ordered vertex V. Thus, the number of steps needed to go from an arbitrary vertex V 
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to V is at most 

2-4 



l + £2( y -4) = l + 2X> = l + 2(\ 3 ) 

y=5 ?^=1 



<(z-3f. 

To go from one arbitrary vertex to another, twice as many steps suffice. □ 

3.7. Subpolytopes of transportation polytopes 

In this section, we study network flow polytopes. Network flow polytopes are faces of classical 
transportation polytopes (see [15| . |130j . and [132] ). We will prove an upper bound on the 
diameter of two subclasses of network flow polytopes. 



Let G = (N, A) be a directed acyclic graph with n nodes and m arcs. (See Figure 3.16 
Each node i G N is assigned a number b(i) which indicates its supply or demand. An arc 
G A indicates a directed edge that goes from the node i to the node j. The network 
flow polytopes come in many varieties. Here, we describe two specific kinds of network flow 
polytopes. (See |6j.) 




The singly- capacitated network flow poly tope P determined by u, I, b, and the acyclic 
directed graph G is the polytope in the m variables 

> for 6 A 

subject to the n equations 



{j'l(vj')eA} {jUi)eA} 
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Remark 3.7.1. We require that the directed graph G is acyclic. If G had a directed cycle, 
then we could add any positive quantity e > to each edge in a cycle, and thus P would be 
an unbounded polyhedron. 

We can, thus, assume that the nodes are labeled from 1 to n and that directed edges 
from i to j always have the property that i > j. 

In the doubly-capacitated version, each arc £ A also has a capacity upper bound 
Uij and a lower bound lij. The doubly- capacitated network flow polytope P determined by 
G, u, I and b is the polytope in the m variables 

Xij S [kj, Uij] for E A 

subject to the n equations 

0'I(*J)6A} O'l(j'.0eU} 

In [230J, Orlin proved the following bound on the diameter of all doubly-capacitated 
network flow polytopes. 

Theorem 3.7.2 (Orlin [230J). The graph of every doubly-capacitated network How polytope 
with n nodes and m arcs has diameter 0(mnlogn). 

Definition 3.7.3. Fix n £ N and 1 < k < n an integer. Let G n ^ = (V, E) be the directed 
graph on the n nodes labeled V = {1, . . . ,n} with the directed edge set E = € 
[n] x [n] : 1 < j — i < k}. 

If the Hirsch Conjecture is true, then the diameter of the singly-capacitated network 
flow polytope for G n ± with any supply vector b is no more than \E\ — (\V\ — 1). Here, we 
show that the diameter is no more than twice that value. We prove a new linear bound on 
the diameters of the polytopes given by the graphs G n ^- 

Theorem 3.7.4. The diameter of the singly-capacitated network flow polytope P of G U} k 
with any supply vector b is at most 2\E\. 

Proof. Let P be the singly-capacitated network flow polytope corresponding to the 
directed graph G n) k- We assume that the polytope P is non-empty. Let x be any arbitrary 
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vertex of P. We will construct a sequence of pivots from the vertex x to the vertex x that 
contains all edges of the form j + 1 = i. 

Suppose the vertex x does not have the edge + Consider the minimum i such 
that the edge (i + 1, i) is not in the support graph of x. Since the support graph of x is a 
tree, adding the edge (i + in the support graph induces a directed cycle. We pivot x on 
that edge (i + which removes some other edge. (Moreover, the edge that is removed 
cannot be one of (2, 1), (3, 2), . . . , (i, i — 1) since these edges do not appear in the cycle.) The 
resulting vertex x' has the property that the edges of the form (2, 1), (3, 2), . . ., 1) 
belong in the support graph of x' . Continuing in this way, we reach the vertex that only has 
edges of the form (j, j — 1). 

Each edge is pivoted on at most once in going from the vertex x of P to the unique 
vertex x of P whose support graph corresponds to a directed path. Thus, the length of 
the path on the graph of P is bounded above by \E\. The distance between two arbitrary 
vertices in the polytope is at most twice that distance. □ 

More generally, we define the following subclass of directed acyclic graphs. 

Definition 3.7.5. A directed acyclic graph on the n nodes V = {1, . . . , n} is said to have 
the intermediate property if: 

• There is an edge from i + 1 to i for all i = 1, . . . , n — 1, 

• The node i is a sink of the G \ {1, ...,£— 1} for each i, and 

• If there is an edge from j to i — 1 in G, then there is an edge from j to i. 



See Figure 3.17 for an example. 



Figure 3.17. A directed acyclic graph that has the intermediate property: 
since the edge (6, 2) is present, the edges (6, 3) and (6, 4), which are indicated 
with dotted lines, must be present. 



As an extension of Theorem 3.7.4 we show that: 
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Theorem 3.7.6. Let G he any directed acyclic graph on n nodes and m edges with the 
intermediate property. Then, the diameter of the singly-capacitated network flow polytope 
P on G with any supply vector b is no more than 2m. 

Proof. Let x be any arbitrary vertex of the singly-capacitated network flow polytope 
P corresponding to the directed graph G with the intermediate property. We will construct 
a sequence of pivots from the vertex x to the vertex x that contains all edges (j, i) of the 
form i + 1 = j. 

Suppose the vertex x does not have the edge + Consider the minimum i such 
that the edge + is not in the support graph of x. Since the support graph of ir is a 
tree, adding the edge (i + 1, i) in the support graph induces a directed cycle, whose existence 
is guaranteed by the intermediate property. We pivot x on that edge {i + 1, i) which removes 
some other edge. (By the orientation, none of the edges of the form (2, 1), (3, 2), . . ., (i + 1, i) 
can be removed from this pivot.) The resulting vertex x' has the property that the edges of 
the form (2, 1), (3, 2), . . ., {i + 1, i) belong in the support graph of x' . Continuing in this 
way, we reach the vertex that only has edges of the form (j + 

Each edge is pivoted on at most once in going from the vertex x of P to the unique 
vertex x of P whose support graph corresponds to a directed path. Thus, the length of 
the path on the graph of P is bounded above by \E\. The distance between two arbitrary 
vertices in the polytope is at most twice that distance. □ 
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APPENDIX A 

A Catalogue of Transportation Polytopes 

This appendix presents a summary of the complete catalogue of transportation polytopes 
of small sizes using the software transportgen (see |176j ). Using computational tools, we 
give a complete catalogue of non-degenerate 2-way and 3-way transportation polytopes 
(both axial and planar) of small sizes. This allows us to explore properties of transportation 
polytopes (e.g., their diameters and how close they were to the Hirsch Conjecture bound). 



The summary of the catalogue is stated in Theorem |1.5.2 



The catalogue was obtained via the exhaustive and systematic computer enumeration 
of all combinatorial types of non-degenerate transportation polytopes. The theoretical 
foundations of it are the notions of parametric linear programming, chamber complex, Gale 



diagrams and secondary polytopes (see [105] ) are presented in Section 1.5 A more complete 



catalogue of transportation polytopes is available in a database on the web (see [175J). 

Table A.l: Possible diameters and numbers of vertices and 
facets for non-degenerate 2x3 classical transportation poly- 
topes 



Property 


Values 


Diameters 


12 3 


Num. vertices 


3 4 5 6 


Num. facets 


3 4 5 6 



Table A. 2: Possible diameters and numbers of vertices and 
facets for non-degenerate 2x4 classical transportation poly- 
topes 



Property 


Values 


Diameters 


12 3 4 


Num. vertices 


4 6 8 10 12 


Num. facets 


4 5 6 7 8 
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Table A. 3: Possible diameters and numbers of vertices and 
facets for non-degenerate 2x5 classical transportation poly- 
topes 



Property 


Values 


Diameters 


1 2 3 4 5 


Num. vertices 


5 8 11 12 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 


Num. facets 


5 6 7 8 9 10 



Table A. 4: Possible diameters and numbers of vertices and 
facets for non-degenerate 3x3 classical transportation poly- 
topes 



Property 


Values 


Diameters 


2 3 4 


Num. vertices 


9 12 15 18 


Num. facets 


6 7 8 9 



Table A. 5: Possible diameters and numbers of vertices and 
facets for non-degenerate 3x4 classical transportation poly- 
topes 



Property 


Values 


Diameters 


2 3 4 5 6 


Num. vertices 


16 21 24 26 27 29 31 32 34 36 37 39 40 41 42 44 45 46 48 49 
50 52 53 54 56 57 58 60 61 62 63 64 66 67 68 70 71 72 74 75 
76 78 80 84 90 96 


Num. facets 


8 9 10 11 12 



Table A. 6: Possible diameters and numbers of vertices and 
facets for non-degenerate 2x2x2 axial transportation poly- 
topes 



Property 


Values 


Diameters 


2 3 4 


Num. vertices 


8 11 14 


Num. facets 


6 7 8 
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Table A. 7: Possible diameters and numbers of vertices and 
facets for non-degenerate 2x2x3 classical transportation 
polytopes 



Property 


Values 


Diameters 


2 3 4 5 


Num. vertices 


18 24 30 32 36 38 40 42 44 46 48 50 52 54 56 58 60 62 64 66 
68 70 72 74 76 78 80 84 86 96 108 


Num. facets 


9 10 11 12 



Table A. 8: Possible diameters and numbers of vertices and 
facets for non-degenerate 2x2x3 planar transportation 
polytopes 



Property 


Values 


Diameters 


12 3 


Num. vertices 


3 4 5 6 


Num. facets 


3 4 5 6 



Table A. 9: Possible diameters and numbers of vertices and 
facets for non-degenerate 2x3x3 planar transportation 
polytopes 



Property 


Values 


Diameters 


3 4 5 6 


Num. vertices 


12 15 16 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 


Num. facets 


7 8 9 10 11 12 
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APPENDIX B 



A Triangulation of the Fourth Birkhoff Polytope 



In Section 3.3 we presented a collection T of simplices and claimed that they were the 



maximal simplices of a triangulation of a certain face F of the fourth Birkhoff polytope. In 



this appendix, we prove that the collection of simplices defined in Section 3.3 are indeed 
a polyhedral subdivision. (The fact that the polyhedral subdivision T is a triangulation 
follows immediately, since all polyhedra in T are simplices.) To prove this, we use the 
following characterization of polyhedral subdivisions presented as Theorem 4.5.9 in [105J. 

Theorem B.0.1. A non-empty set T of d-dimensional subconhgurations of a point configu- 
ration A in M rf is the set of maximal cells of a polyhedral subdivision of A if and only if it 
satisfies the following conditions: 

• For each facet G of a d-cell B in T, either G is contained in a facet of A or there is 
another d-cell B in T that contains G as a facet. 

• conv(,B) n conv(fi') = conv(£ n B') for all B, B' £ T. 

• For all label sets B, B' G T, B n B' is a face of both B and B' . 

Since the points X a , . . . , X n are in convex position, we simply need to check the following 
property: 

• For every facet G of every maximal simplex Oi in 7~, either: 

— G is contained in a facet of F, or 

— there is another maximal simplex Uj in T that contains G as a facet, with 
3 + i- 

Below, we verify this property for each of the eight facets of the 32 maximal simplices: 

The facets of the simplex o\ are bcdefgi, acdefgi, abdefgi, abcefgi, abcdfgi, abcdegi, 
abcdefi, and abcdefg. 

(1) The facet bcdefgi is contained in the facet of B4 defined by X3 t 3 = 0. 

(2) The facet acdefgi is also a facet of the simplex 03. 
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(3) The facet abdefgi is also a facet of the simplex o^- 

(4) The facet abcefgi is contained in the facet of B4 defined by 2:2,1 = 0. 

(5) The facet abcdfgi is also a facet of the simplex 1724. 

(6) The facet abcdegi is contained in the facet of B4 defined by £4,2 = 0. 

(7) The facet abcdefi is contained in the facet of B4 defined by 2:2,2 = 0. 

(8) The facet abcdefg is contained in the facet of B4 defined by £1,4 = 0. 

The facets of the simplex 02 are abdefgi, bdefghi, adefghi, abefghi, abdfghi, abdeghi, 
abdefhi, and abdefgh. 

(1) The facet abdefgi is also a facet of the simplex o\. 

(2) The facet bdefghi is contained in the facet of B4 defined by £3,3 = 0. 

(3) The facet adefghi is also a facet of the simplex 04. 

(4) The facet abefghi is contained in the facet of B4 defined by £4,3 = 0. 

(5) The facet abdfghi is also a facet of the simplex 02s- 

(6) The facet abdeghi is contained in the facet of B4 defined by 2:3,1 = 0. 

(7) The facet abdefhi is contained in the facet of B4 defined by 2:2,2 = 0. 

(8) The facet abdefgh is contained in the facet of B4 defined by 2:1,4 = 0. 

The facets of the simplex 03 are acdefgi, cdefgik, adefgik, acefgik, acdfgik, acdegik, 
acdefik, and acdefgk. 

(1) The facet acdefgi is also a facet of the simplex o\. 

(2) The facet cdefgik is contained in the facet of B4 defined by £3,3 = 0. 

(3) The facet adefgik is also a facet of the simplex 04. 

(4) The facet acefgik is contained in the facet of B4 defined by 2:2,1 = 0. 

(5) The facet acdfgik is also a facet of the simplex 024. 

(6) The facet acdegik is contained in the facet of B4 defined by 2:4,2 = 0. 

(7) The facet acdefik is also a facet of the simplex 0%. 

(8) The facet acdefgk is contained in the facet of B4 defined by 2:2,3 = 0. 

The facets of the simplex 0-4 are adefghi, adefgik, defghik, aefghik, adfghik, adeghik, 
adefhik, and adefghk. 

(1) The facet adefghi is also a facet of the simplex 02. 

(2) The facet adefgik is also a facet of the simplex 0-3. 
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(3) The facet defghik is contained in the facet of B4 defined by £3,3 = 0. 

(4) The facet aefghik is also a facet of the simplex u 5 . 

(5) The facet adfghik is also a facet of the simplex 028- 

(6) The facet adeghik is also a facet of the simplex ctq. 

(7) The facet adefhik is also a facet of the simplex 09. 

(8) The facet adefghk is contained in the facet of B4 defined by ^2,3 = 0. 

The facets of the simplex 5 are aefghik, efghikl, afghikl, aeghikl, aefhikl, aefgikl, 
aefghkl, and aefghil. 

(1) The facet aefghik is also a facet of the simplex a 4. 

(2) The facet efghikl is contained in the facet of B4 defined by xip = 0. 

(3) The facet afghikl is also a facet of the simplex 0-7. 

(4) The facet aeghikl is also a facet of the simplex o§. 

(5) The facet aefhikl is also a facet of the simplex o\2- 

(6) The facet aefgikl is contained in the facet of B4 defined by X2,i = 0. 

(7) The facet aefghkl is contained in the facet of B4 defined by #2,3 = 0. 

(8) The facet aefghil is contained in the facet of B4 defined by £4,3 = 0. 

The facets of the simplex cfq are adeghik, aeghikl, deghikl, adghikl, adehikl, adegikl, 
adeghkl, and adeghil. 

(1) The facet adeghik is also a facet of the simplex 04 . 

(2) The facet aeghikl is also a facet of the simplex 05. 

(3) The facet deghikl is also a facet of the simplex CJ13. 

(4) The facet adghikl is also a facet of the simplex 029. 

(5) The facet adehikl is also a facet of the simplex 014. 

(6) The facet adegikl is contained in the facet of B4 defined by 3^2 = 0. 

(7) The facet adeghkl is contained in the facet of B4 defined by X2,3 = 0. 

(8) The facet adeghil is contained in the facet of B4 defined by £3,1 = 0. 

The facets of the simplex 07 are afghikl, fghijkl, aghijkl, afhijkl, afgijkl, afghjkl, 
afghijl, and afghijk. 

(1) The facet afghikl is also a facet of the simplex 05. 

(2) The facet fghijkl is contained in the facet of B4 defined by xi^ = 0. 
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(3) The facet aghijkl is also a facet of the simplex 026 • 

(4) The facet afhijkl is also a facet of the simplex 022 • 

(5) The facet afgijkl is contained in the facet of B4 defined by X2,i = 0. 

(6) The facet afghjkl is contained in the facet of B4 defined by x^p = 0. 

(7) The facet afghijl is contained in the facet of B4 defined by = 0. 

(8) The facet afghijk is also a facet of the simplex 025. 

The facets of the simplex ug are acdefik, cdefikm, adefikm, acefikm, acdfikm, 
acdeikm, acdefkm, and acdefim. 

(1) The facet acdefik is also a facet of the simplex 0-3. 

(2) The facet cdefikm is contained in the facet of B4 defined by 23,3 = 0. 

(3) The facet adefikm is also a facet of the simplex erg. 

(4) The facet acefikm is contained in the facet of B4 defined by £3,4 = 0. 

(5) The facet acdfikm is also a facet of the simplex a±o- 

(6) The facet acdeikm is contained in the facet of B4 defined by = 0. 

(7) The facet acdefkm is contained in the facet of B4 defined by ^2,3 = 0. 

(8) The facet acdefim is contained in the facet of B4 defined by £2,2 = 0. 

The facets of the simplex ag are adefhik, adefikm, defhikm, aefhikm, adfhikm, 
adehikm, adefhkm, and adefhim. 

(1) The facet adefhik is also a facet of the simplex a 4. 

(2) The facet adefikm is also a facet of the simplex a%. 

(3) The facet defhikm is contained in the facet of B4 defined by £3,3 = 0. 

(4) The facet aefhikm is also a facet of the simplex oyi- 

(5) The facet adfhikm is also a facet of the simplex a±\. 

(6) The facet adehikm is also a facet of the simplex 014. 

(7) The facet adefhkm is contained in the facet of B4 defined by £2,3 = 0. 

(8) The facet adefhim is contained in the facet of B4 defined by ^2,2 = 0. 

The facets of the simplex a±o are acdfikm, cdfijkm, adfijkm, acfijkm, acdijkm, 
acdfjkm, aedfijm, and aedfijk. 

(1) The facet acdfikm is also a facet of the simplex as- 

(2) The facet cdfijkm is contained in the facet of B4 defined by £3,3 = 0. 
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(3) The facet adfijkm is also a facet of the simplex an. 

(4) The facet acfijkm is contained in the facet of B4 defined by x^,4 = 0. 

(5) The facet acdijkm is contained in the facet of B4 defined by 21,3 = 0. 

(6) The facet acdfjkm is also a facet of the simplex 015. 

(7) The facet acdfijm is contained in the facet of B4 defined by 2:2,2 = 0. 

(8) The facet acdfijk is also a facet of the simplex 023. 

The facets of the simplex o\\ are adfhikm, adfijkm, dfhijkm, afhijkm, adhijkm, 
adfhjkm, adfhijm, and adfhijk. 

(1) The facet adfhikm is also a facet of the simplex 09. 

(2) The facet adfijkm is also a facet of the simplex 010. 

(3) The facet dfhijkm is contained in the facet of B4 defined by 23,3 = 0. 

(4) The facet afhijkm is also a facet of the simplex 020- 

(5) The facet adhijkm is also a facet of the simplex 021. 

(6) The facet adfhjkm is also a facet of the simplex 010 . 

(7) The facet adfhijm is contained in the facet of B4 defined by 22,2 = 0. 

(8) The facet adfhijk is also a facet of the simplex 027. 

The facets of the simplex 012 are aefhikl, aefhikm, efhiklm, afhiklm, aehiklm, 
aefiklm, aefhklm, and aefhilm. 

(1) The facet aefhikl is also a facet of the simplex 05. 

(2) The facet aefhikm is also a facet of the simplex 09. 

(3) The facet efhiklm is contained in the facet of B4 defined by 2^2 = 0. 

(4) The facet afhiklm is also a facet of the simplex a\g- 

(5) The facet aehiklm is also a facet of the simplex 014. 

(6) The facet aefiklm is contained in the facet of B4 defined by 23,4 = 0. 

(7) The facet aefhklm is contained in the facet of B4 defined by 22,3 = 0. 

(8) The facet aefhilm is also a facet of the simplex 017. 

The facets of the simplex 013 are deghikl, eghiklm, dghiklm, dehiklm, degiklm, 
deghklm, deghilm, and deghikm. 

(1) The facet deghikl is also a facet of the simplex a^. 

(2) The facet eghiklm is contained in the facet of B4 defined by x\ : 2 = 0. 



(3) The facet dghiklm is contained in the facet of B4 defined by #2,4 = 0. 

(4) The facet dehiklm is also a facet of the simplex au. 

(5) The facet degiklm is contained in the facet of B4 defined by £4,2 = 0. 

(6) The facet deghklm is contained in the facet of B4 defined by ^2,3 = 0. 

(7) The facet deghilm is contained in the facet of B4 defined by x$ t i = 0. 

(8) The facet deghikm is contained in the facet of B4 defined by £3,3 = 0. 

The facets of the simplex are adehikl, adehikm, aehiklm, dehiklm, adhiklm 
adeiklm, adehklm, and adehilm. 

(1) The facet adehikl is also a facet of the simplex a§. 

(2) The facet adehikm is also a facet of the simplex 09. 

(3) The facet aehiklm is also a facet of the simplex o\i. 

(4) The facet dehiklm is also a facet of the simplex 013 . 

(5) The facet adhiklm is also a facet of the simplex 019. 

(6) The facet adeiklm is contained in the facet of B4 defined by X4 t 2 = 0. 

(7) The facet adehklm is contained in the facet of B4 defined by #2,3 = 0. 

(8) The facet adehilm is contained in the facet of B4 defined by x^,i = 0. 

The facets of the simplex CJ15 are acdfjkm, cdfjkmn, adfjkmn, acfjkmn, acdjkmn 
acdfkmn, acdfjmn, and acdfjkn. 

(1) The facet acdfjkm is also a facet of the simplex a±o- 

(2) The facet cdfjkmn is contained in the facet of B4 defined by 0:4,1 = 0. 

(3) The facet adfjkmn is also a facet of the simplex cjiq. 

(4) The facet acfjkmn is contained in the facet of B4 defined by #3,4 = 0. 

(5) The facet acdjkmn is contained in the facet of B4 defined by = 0. 

(6) The facet acdfkmn is contained in the facet of B4 defined by #2,3 = 0. 

(7) The facet acdfjmn is contained in the facet of B4 defined by 22,2 = 0- 

(8) The facet acdfjkn is contained in the facet of B4 defined by x^^ = 0. 

The facets of the simplex cr 16 are adfhjkm, adfjkmn, dfhjkmn, afhjkmn, adhjkmn 
adfhkmn, adfhjmn, and adfhjkn. 

(1) The facet adfhjkm is also a facet of the simplex an. 

(2) The facet adfjkmn is also a facet of the simplex U15. 
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(3) The facet dfhjkmn is contained in the facet of B4 defined by 2:4,1 = 0. 

(4) The facet afhjkmn is also a facet of the simplex 020 ■ 

(5) The facet adhjkmn is also a facet of the simplex 021. 

(6) The facet adfhkmn is contained in the facet of B4 defined by 2:2,3 = 0. 

(7) The facet adfhjmn is contained in the facet of B4 defined by X2,2 = 0. 

(8) The facet adfhjkn is contained in the facet of B4 defined by £3,2 = 0. 

The facets of the simplex an are aefhilm, efhilmn, afhilmn, aehilmn, aefilmn, 
aefhlmn, aefhimn, and aefhiln. 

(1) The facet aefhilm is also a facet of the simplex 012 . 

(2) The facet efhilmn is contained in the facet of B4 defined by xi,2 = 0. 

(3) The facet afhilmn is also a facet of the simplex dig. 

(4) The facet aehilmn is contained in the facet of B4 defined by 2:3,1 = 0. 

(5) The facet aefilmn is contained in the facet of B4 defined by 2:3,4 = 0. 

(6) The facet aefhlmn is contained in the facet of B4 defined by £2,3 = 0. 

(7) The facet aefhimn is contained in the facet of B4 defined by 2:2,2 = 0. 

(8) The facet aefhiln is contained in the facet of B4 defined by £4,3 = 0. 

The facets of the simplex <7ig are afhiklm, afhilmn, fhiklmn, ahiklmn, afiklmn, 
afhklmn, afhikmn, and afhikln. 

(1) The facet afhiklm is also a facet of the simplex a\2- 

(2) The facet afhilmn is also a facet of the simplex 017. 

(3) The facet fhiklmn is contained in the facet of B4 defined by 2:1,2 = 0. 

(4) The facet ahiklmn is also a facet of the simplex a\g. 

(5) The facet afiklmn is contained in the facet of B4 defined by 2:3,4 = 0. 

(6) The facet afhklmn is contained in the facet of B4 defined by #2,3 = 0. 

(7) The facet afhikmn is also a facet of the simplex <72o- 

(8) The facet afhikln is also a facet of the simplex 022. 

The facets of the simplex a\g are adhiklm, ahiklmn, dhiklmn, adiklmn, adhklmn, 
adhilmn, adhikmn, and adhikln. 

(1) The facet adhiklm is also a facet of the simplex 014. 

(2) The facet ahiklmn is also a facet of the simplex dig. 
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(3) The facet dhiklmn is contained in the facet of B4 defined by #2,4 = 0. 

(4) The facet adiklmn is contained in the facet of B4 defined by X\ : s = 0. 

(5) The facet adhklmn is contained in the facet of B4 defined by X2,3 = 0. 

(6) The facet adhilmn is contained in the facet of B4 defined by x$ t i = 0. 

(7) The facet adhikmn is also a facet of the simplex 021. 

(8) The facet adhikln is also a facet of the simplex 031. 

The facets of the simplex 020 are afhijkm, afhjkmn, afhikmn, fhijkmn, ahijkmn, 
afijkmn, afhijmn, and afhijkn. 

(1) The facet afhijkm is also a facet of the simplex an- 

(2) The facet afhjkmn is also a facet of the simplex aiQ. 

(3) The facet afhikmn is also a facet of the simplex a\g. 

(4) The facet fhijkmn is contained in the facet of B4 defined by x\p = 0. 

(5) The facet ahijkmn is also a facet of the simplex 021- 

(6) The facet afijkmn is contained in the facet of B4 defined by 0:3,4 = 0. 

(7) The facet afhijmn is contained in the facet of B4 defined by £2,2 = 0. 

(8) The facet afhijkn is also a facet of the simplex 022 • 

The facets of the simplex 021 are adhijkm, adhjkmn, adhikmn, ahijkmn, dhijkmn, 
adijkmn, adhijmn, and adhijkn. 

(1) The facet adhijkm is also a facet of the simplex an- 

(2) The facet adhjkmn is also a facet of the simplex a±e. 

(3) The facet adhikmn is also a facet of the simplex 019. 

(4) The facet ahijkmn is also a facet of the simplex 020 • 

(5) The facet dhijkmn is contained in the facet of B4 defined by X2,4 = 0. 

(6) The facet adijkmn is contained in the facet of B4 defined by x\ : s = 0. 

(7) The facet adhijmn is contained in the facet of B4 defined by #2,2 = 0- 

(8) The facet adhijkn is also a facet of the simplex 0-32- 

The facets of the simplex 0-22 are afhijkl, afhikln, afhijkn, fhijkln, ahijkln, afijkln, 
afhjkln, and afhijln. 

(1) The facet afhijkl is also a facet of the simplex 07. 

(2) The facet afhikln is also a facet of the simplex oig. 
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(3) The facet afhijkn is also a facet of the simplex o^o- 

(4) The facet fhijkln is contained in the facet of B4 defined by = 0. 

(5) The facet ahijkln is also a facet of the simplex 0-30- 

(6) The facet afijkln is contained in the facet of B4 defined by 23,4 = 0. 

(7) The facet afhjkln is contained in the facet of B4 defined by 23^ = 0. 

(8) The facet afhijln is contained in the facet of B4 defined by 24,3 = 0. 

The facets of the simplex 023 are acdfijk, bcdfijk, abdfijk, abcfijk, abcdijk, abcdfjk, 
abcdfik, and abcdfij. 

(1) The facet acdfijk is also a facet of the simplex 010 . 

(2) The facet bcdfijk is contained in the facet of B4 defined by £3,3 = 0. 

(3) The facet abdfijk is also a facet of the simplex 027. 

(4) The facet abcfijk is contained in the facet of B4 defined by 22,1 = 0. 

(5) The facet abcdijk is contained in the facet of B4 defined by 21,3 = 0. 

(6) The facet abcdfjk is contained in the facet of B4 defined by #3,2 = 0. 

(7) The facet abcdfik is also a facet of the simplex 024. 

(8) The facet abcdfij is contained in the facet of B4 defined by 22,2 = 0. 

The facets of the simplex 024 are abcdfgi, acdfgik, abcdfik, bcdfgik, abdfgik, abcfgik, 
abcdgik, and abcdfgk. 

(1) The facet abcdfgi is also a facet of the simplex o\. 

(2) The facet acdfgik is also a facet of the simplex (73. 

(3) The facet abcdfik is also a facet of the simplex 023. 

(4) The facet bcdfgik is contained in the facet of B4 defined by 23,3 = 0. 

(5) The facet abdfgik is also a facet of the simplex 028 • 

(6) The facet abcfgik is contained in the facet of B4 defined by 22,1 = 0. 

(7) The facet abcdgik is contained in the facet of B4 defined by 24,2 = 0. 

(8) The facet abcdfgk is contained in the facet of B4 defined by 23,2 = 0. 

The facets of the simplex 025 are afghijk, bfghijk, abghijk, abfhijk, abfgijk, abfghjk, 
abfghik, and abfghij. 

(1) The facet afghijk is also a facet of the simplex 07. 

(2) The facet bfghijk is contained in the facet of B4 defined by 23,3 = 0. 
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(3) The facet abghijk is also a facet of the simplex 026 • 

(4) The facet abfhijk is also a facet of the simplex 027. 

(5) The facet abfgijk is contained in the facet of B4 defined by 2:2,1 = 0. 

(6) The facet abfghjk is contained in the facet of B4 defined by 2:3,2 = 0. 

(7) The facet abfghik is also a facet of the simplex 028- 

(8) The facet abfghij is contained in the facet of B4 defined by £4,3 = 0. 

The facets of the simplex 026 are aghijkl, abghijk, bghijkl, abhijkl, abgijkl, abghjkl, 
abghikl, and abghijl. 

(1) The facet aghijkl is also a facet of the simplex 0-7. 

(2) The facet abghijk is also a facet of the simplex 0-25. 

(3) The facet bghijkl is contained in the facet of B4 defined by £2,4 = 0. 

(4) The facet abhijkl is also a facet of the simplex 0-30- 

(5) The facet abgijkl is contained in the facet of B4 defined by 2:2,1 = 0. 

(6) The facet abghjkl is contained in the facet of B4 defined by x^ t 2 = 0. 

(7) The facet abghikl is also a facet of the simplex 0-29. 

(8) The facet abghijl is contained in the facet of B4 defined by 2:4,3 = 0. 

The facets of the simplex 0-27 are adfhijk, abdfijk, abfhijk, bdfhijk, abdhijk, abdfhjk, 
abdfhik, and abdfhij. 

(1) The facet adfhijk is also a facet of the simplex an. 

(2) The facet abdfijk is also a facet of the simplex 0-23. 

(3) The facet abfhijk is also a facet of the simplex 0-25. 

(4) The facet bdfhijk is contained in the facet of B4 defined by x^,3 = 0. 

(5) The facet abdhijk is also a facet of the simplex 032. 

(6) The facet abdfhjk is contained in the facet of B4 defined by 2:3,2 = 0. 

(7) The facet abdfhik is also a facet of the simplex 02s- 

(8) The facet abdfhij is contained in the facet of B4 defined by X2,2 = 0. 

The facets of the simplex 028 are abdfghi, adfghik, abdfgik, abfghik, abdfhik, bdfghik, 
abdghik, and abdfghk. 

(1) The facet abdfghi is also a facet of the simplex 0-2. 

(2) The facet adfghik is also a facet of the simplex 0-4. 
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(3) The facet abdfgik is also a facet of the simplex 024. 

(4) The facet abfghik is also a facet of the simplex 025. 

(5) The facet abdfhik is also a facet of the simplex CT27. 

(6) The facet bdfghik is contained in the facet of B4 defined by 2:3,3 = 0. 

(7) The facet abdghik is also a facet of the simplex 029. 

(8) The facet abdfghk is contained in the facet of B4 defined by £3,2 = 0. 

The facets of the simplex 029 are adghikl, abghikl, abdghik, bdghikl, abdhikl, abdgikl, 
abdghkl, and abdghil. 

(1) The facet adghikl is also a facet of the simplex a§. 

(2) The facet abghikl is also a facet of the simplex 026- 

(3) The facet abdghik is also a facet of the simplex 028 • 

(4) The facet bdghikl is contained in the facet of B4 defined by 22,4 = 0. 

(5) The facet abdhikl is also a facet of the simplex (731. 

(6) The facet abdgikl is contained in the facet of B4 defined by 24,2 = 0. 

(7) The facet abdghkl is contained in the facet of B4 defined by 23,2 = 0. 

(8) The facet abdghil is contained in the facet of B4 defined by 2:3,1 = 0. 

The facets of the simplex 030 are ahijkln, abhijkl, bhijkln, abijkln, abhjkln, abhikln, 
abhijln, and abhijkn. 

(1) The facet ahijkln is also a facet of the simplex 022 • 

(2) The facet abhijkl is also a facet of the simplex 026- 

(3) The facet bhijkln is contained in the facet of B4 defined by 22,4 = 0. 

(4) The facet abijkln is contained in the facet of B4 defined by 21,3 = 0. 

(5) The facet abhjkln is contained in the facet of B4 defined by 23,2 = 0. 

(6) The facet abhikln is also a facet of the simplex 0-31. 

(7) The facet abhijln is contained in the facet of B4 defined by 24,3 = 0. 

(8) The facet abhijkn is also a facet of the simplex 032. 

The facets of the simplex 031 are adhikln, abdhikl, abhikln, bdhikln, abdikln, abdhkln, 
abdhiln, and abdhikn. 

(1) The facet adhikln is also a facet of the simplex a\g. 

(2) The facet abdhikl is also a facet of the simplex 0-29. 



(3) The facet abhikln is also a facet of the simplex 0-30- 

(4) The facet bdhikln is contained in the facet of B4 defined by X2,4 = 0. 

(5) The facet abdikln is contained in the facet of B4 defined by #1,3 = 0. 

(6) The facet abdhkln is contained in the facet of B4 defined by x^ t 2 = 0. 

(7) The facet abdhiln is contained in the facet of B4 defined by a^i = 0. 

(8) The facet abdhikn is also a facet of the simplex 032. 

The facets of the simplex 032 are adhijkn, abdhijk, abhijkn, abdhikn, bdhijkn, abdijk 
abdhjkn, and abdhijn. 

(1) The facet adhijkn is also a facet of the simplex 021. 

(2) The facet abdhijk is also a facet of the simplex 027. 

(3) The facet abhijkn is also a facet of the simplex 0-30- 

(4) The facet abdhikn is also a facet of the simplex 1731. 

(5) The facet bdhijkn is contained in the facet of B4 defined by a?2,4 = 0. 

(6) The facet abdijkn is contained in the facet of B4 defined by X\j$ = 0. 

(7) The facet abdhjkn is contained in the facet of B4 defined by x^,2 = 0. 

(8) The facet abdhijn is contained in the facet of B4 defined by X2,2 = 0. 
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APPENDIX C 

A Counter-example to the Hirsch Conjecture 

Right before this dissertation was ready for submission, on the 10th of May 2010, Santos 
(sec [245J) announced his construction of the first counter-example to the Hirsch Conjecture 



(see Conjecture 2.0.1). In |245] , Santos describes the construction of a 43-dimensional 
polytope P with 86 facets whose diameter is strictly greater than 43. Santos' construction 
in [245J begins with a prismatoid. 

Definition C.0.1. A prismatoid is a polytope Q with two distinguished facets F\ and F<i 
(called the base facets ) so that every vertex of Q is contained in exactly one of F\ or F<i . 

Cubes and cross-polytopes are examples of prismatoids, and any pair of disjoint facets 
can be the base facets. The distance between the base facets in the dual graph G^(Q) of a 
prismatoid Q is the main combinatorial property Santos analyzes in [245 . More precisely, 

Definition CO. 2. Let Q he a d-dimensional prismatoid with base facets F\ and F2, and let 
G A (Q) be the polar graph of Q. We say that the prismatoid Q satisfies the d-step property 
(with respect to its base facets F\ and F2) if the distance from F\ to F2 in G^(Q) is at most 
d. Otherwise, we say that Q does not satisfy the ci-step property. 

In [245], Santos constructs a 5-dimensional prismatoid Q with 48 vertices that does not 
satisfy the <i-step property. The distance between the base facets F\ and F2 in the dual 
graph G A (Q) of Q is strictly greater than five: 

Theorem CO. 3 (Santos [245J). There is a 5-dimensional prismatoid Q with 48 vertices 
that does not satisfy the d-step property. The distance between the base facets F\ and F2 
of Q in the dual graph G A (Q) of Q is exactly six. 



The data files needed to verify Theorem CO. 3 are available on the web (see [177J). 
The counterexample to the Hirsch Conjecture is obtained from the 5-dimensional Santos 
prismatoid Q by repeatedly applying a polytopal construction: 
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Theorem CO. 4 (Santos [245]). Let n > 2d. If there is a d-dimensional prismatoid Q with 
n vertices that does not satisfy the d-step property (with respect to its base facets), then 
there is a (d + l)-dimensional prismatoid Q with n + 1 vertices that does not satisfy the 
d-step property (with respect to its base facets). 



Theorem CO. 4 reduces the asimpliciality s = n—2d of a prismatoid. Repeated application 
of the theorem yields a prismatoid Z whose asimpliciality s = 0, and thus the resulting 
prismatoid Z has n = 2d. Santos applies this theorem 38 times to the 5-dimensional 
prismatoid Q with 48 vertices, obtaining a 43- dimensional prismatoid Z with 86 vertices 
that does not satisfy the d-step property with respect to its base facets F\ and F<i- That 
is to say, the distance between F\ and F2 in the dual graph G A (Z) of Z is strictly greater 
than 43. The polar P = Z A of Z is a 43-dimensional polytope with 86 facets that does not 
satisfy the Hirsch Conjecture, since the distance between the vertices that are dual to F% 
and F2 in the graph G(P) of P is strictly greater than 43. 

The monumental result leads to many new questions and open problems (and renews 
interest in the Polynomial Diameter Conjecture), including the following: 

Open Problem CO. 5. What is the smallest dimension d for which the Hirsch Conjecture 
fails? 

Open Problem CO. 6. Are there d-polytopes with n facets whose diameter is greater than 
§(n — d)? Are there d-polytopes with n facets whose diameter is greater than (n — d) 2 ? 

Open Problem CO. 7. Are there 4-dimensional prismatoids without the d-step property? 
(It is easy to prove that ^-dimensional prismatoids satisfy the d-step property.) 

Open Problem CO. 8. Find explicit coordinates for the non-Hirsch A?>-dimensional polytope 



derived from Santos' prismatoid Q and Theorem CO. 4 According to Santos, the polytope 
is expected to have between 10 9 and 10 12 vertices. 
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